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PREFACE 


During  the  week  of  July  1-7, 1999,  the  Fourth  International  Conference  on 
Curves  and  Surfaces  was  held  in  Saint-Malo  (France).  It  was  organized  by  the 
Association  Frangaise  d ’Approximation,  (A.F.A.).  The  organizing  committee 
consisted  of  L.  Amodei  (Toulouse),  J.-L.  Bauchat  (Metz),  A.  Cohen  (Paris), 
J.-C.  Fiorot  (Valenciennes),  J.  Caches  (Toulouse),  G.-P.  Bonneau  (Grenoble), 
Y.  Lafranche  (Rennes),  P.-J.  Laurent  (Grenoble),  M.-L.  Mazure  (Grenoble), 
J.-L.  Merrien  (Rennes),  C.  Potier  (Paris),  C.  Rabut  (Toulouse),  P.  Sablonniere 
(Rennes),  L.L.  Schumaker  (Nashville),  C.  Vercken  (Paris). 

The  conference  was  attended  by  275  mathematicians  from  37  different 
countries,  and  the  program  included  10  invited  one-hour  lectures  and  190 
half-hour  research  talks  or  poster  presentations.  A  number  of  research  talks 
were  presented  in  eight  minisymposia  organized  by  W.  Dahmen,  R.  DeVore, 
D.  Donoho,  J.  Hoschek,  B.  Lacolle,  H.  Pottmann,  M.  Sabin,  and  J.  Stockier. 

The  proceedings  of  this  conference  consists  of  this  volume  (containing  43 
papers),  and  the  companion  volume  Curve  and  Surface  Design:  Saint-Malo 
1999  (containing  45  papers). 

We  would  like  to  thank  the  following  institutions  for  their  financial  or 
technical  support  and  their  contribution  to  the  success  of  this  conference: 
Ministere  de  I’Education  Nationale,  de  la  Recherche  et  de  la  Technologie; 
European  Office  of  Aerospace  Research  and  Development  (Air  Force  Office 
of  Scientific  Research,  United  States  Air  Force  Research  Laboratory);  Insti- 
tut  National  des  Sciences  Appliquees  de  Rennes;  Institut  d’lnformatique  et 
de  Mathematiques  Appliquees  de  Grenoble;  Conseil  Regional  de  Bretagne; 
Ministere  de  la  Defense  (contrat  No  9960014,  Direction  des  Systemes  de 
Forces  et  de  la  Prospective,  Service  de  la  Recherche  et  des  Etudes  Amont, 
Sous-direction  Scientifique,  Bureau  de  la  Prospective  Scientifique,  Delegation 
Generale  pour  PArmement);  Universite  Pierre  et  Marie  Curie  (Paris);  Labo- 
ratoire  de  Modelisation  et  Calcul  de  Grenoble;  Institut  National  des  Sciences 
Appliquees  de  Toulouse;  Universite  Joseph  Fourier  (Grenoble);  Vanderbilt 
University  (Nashville);  Ministere  des  Affaires  Etrangeres;  Matra  Datavision; 
Ecole  Nationale  Superieure  des  Arts  et  Metiers  de  Metz;  France  Telecom; 
Ecole  Nationale  Superieure  des  Telecommunications  (Paris);  Ecole  Centrale 
de  Nantes. 

Finally,  we  would  like  to  thank  Gerda  Schumaker  for  her  assistance  with 
the  preparation  of  the  proceedings. 
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Extending  Lawson’s  Algorithm  to  Include 
the  Huber  M-Estimator 


Iain  J.  Anderson,  John  C.  Mason, 
and  Colin  Ross 


Abstract.  When  fitting  a  curve  to  experimental  data,  there  is  no 
guarantee  that  the  data  obtained  are  as  accurate  as  might  be  expected. 
The  effect  of  outside  influences  may  cause  the  data  set  to  contain  outliers. 
These  outliers  can  have  a  significant  effect  on  any  curve  which  is  fitted 
to  such  data.^  The  £c»-norm,  which  is  particularly  appropriate  for  fitting 
data  with  uniformly  distributed  errors,  is  extremely  sensitive  to  such  out¬ 
liers,  since  it  minimises  the  maximum  error  from  the  data  to  the  curve. 
Therefore,  a  technique  which  approximates  a  data  set  using  the  .^oo-norm, 
without  being  adversely  affected  by  outliers,  would  be  a  useful  addition  to 
the  array  of  tools  available.  We  present  numerical  examples  to  illustrate 
the  use  of  such  a  technique  and  also  some  practical  applications  to  justify 
its  use. 


§1.  Introduction 

It  is  widely  accepted  that  the  ^oo-norm  is  the  most  appropriate  measure  of 
the  error  when  approximating  data  which  are  very  accurate  or  have  errors 
sampled  from  a  uniform  distribution.  Unfortunately,  because  the  norm  is 
extremely  sensitive  to  outliers,  it  is  not  suitable  for  use  in  fitting  experimental 
data  containing  such  points.  Nevertheless,  it  may  be  the  case  that  the  (.00- 
norm  is  the  most  appropriate  error  measure  for  the  non-outlying  data,  and  so 
we  present  an  algorithm  for  finding  an  ^oo  fit  to  the  non-outliers  of  a  data  set. 

The  algorithm  itself  is  based  on  a  combination  of  the  Huber  M-estimator 
[6]  and  Lawson’s  algorithm  [7].  There  is  considerable  literature  on  both  tech¬ 
niques  as  separate  subjects,  and  we  mention  here  only  a  selection.  Lawson’s 
algorithm  was  first  analysed  by  Lawson  [7]  in  1961,  and  was  later  studied 
by  Rice  and  Usow  [11],  Cline  [2]  and  Ellacott  [4].  Similarly,  the  Huber  M- 
estimator  was  developed  by  Huber  [6]  in  1964  and  has  received  considerable 
attention  in  the  form  of  algorithms  for  its  solution  as  well  as  analyses  of  its 
behaviour.  Papers  by  Clark  and  Osborne  [1],  Ekblom  [3],  Madsen  and  Nielsen 
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[9],  Michelot  and  Bougeard  [10]  and  Li  [8]  all  look  at  the  Huber  M-estimator 
either  in  its  own  right  or  as  one  of  a  class  of  robust  estimators. 

In  this  paper,  we  consider  the  problem  of  fitting  a  function  of  the  linear 
form  f{x)  =  to  a  set  of  data  where  the  {^j}  are  a 

set  of  basis  functions.  To  this  end,  we  minimise  the  residuals  Vi  =  yi  —  f{xi). 
What  our  algorithm  achieves  in  practice  is  to  obtain  an  ^oo  fif  those  Vi  such 
that  \ri\  is  less  than  the  Huber  parameter  7,  say,  and  effectively  to  ignore  the 
remaining  data. 

The  circumstances  that  require  such  an  algorithm  occur  in  practice,  par¬ 
ticularly  in  metrology  where  extremely  accurate  readings  can  be  obtained  (by, 
for  example,  a  CD  reader)  but  are  subject  to  the  occasional  outlier  (due,  for 
example,  to  optical  effects).  These  outliers  usually  only  appear  in  groups  of 
one  or  two,  so  they  are  isolated,  which  leads  to  an  easier  problem  than  if  they 
appeared  in  larger  groups.  Another  metrological  situation  where  this  algo¬ 
rithm  can  be  applied  is  in  the  measurement  of  a  cylinder  in  an  automotive 
engine  where  there  is  approximately  95%  very  accurate  data,  and  5%  outliers. 
Naturally,  these  problems  might  require  a  slightly  different  fitting  technique, 
but  this  algorithm  is  a  useful  starting  point  from  which  more  general  fitting 
procedures  may  be  developed  in  future  work. 


§2.  Background 

In  this  section,  we  discuss  some  aspects  of  both  Huber  estimation  and  Lawson’s 
algorithm.  In  the  next  section  we  describe  how  to  combine  the  two  techniques 
to  create  a  new  algorithm  which  satisfies  our  requirements. 


The  Huber  M-estimator 


The  Huber  M-estimator  is  based  on  the  Huber  function 


tV2,  if|i|<l, 
|t|-l/2,  iflt|>l, 


(1) 


introduced  by  Huber  [6]  in  1964,  and  is  defined  in  the  following  straightforward 
way: 

m 

(2) 

i=l 

where  Vi  is  the  residual  in  the  ith  datum,  and  7  is  the  Huber  threshold  defining 
the  distinction  between  “accurate”  and  “inaccurate”  data. 

There  are  several  algorithms  to  solve  the  problem  of  minimising  (2)  with 
respect  to  c,  several  of  which  are  described  by  Li  [8].  However  in  this  paper, 
we  limit  ourselves  to  the  Newton  method.  This  involves  solving  [8] 


■^A'^DAp  =  -A'^v 
T  7 

at  each  iteration,  where  A  is  the  design  matrix  with  entries  Aij  =  (}>j{xi),  D 
is  a  diagonal  matrix  with  entries  Da  =  1  if  |ri|  <  7  and  Du  =  0  if  |ri|  >7, 
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and  V  has  entries  Vi  =  p'iri/j).  Solving  this  system  gives  an  update  vector 
p  which  should  provide  a  better  estimate  c  +  p  of  the  solution  parameters 
c*.  In  order  to  ensure  convergence,  we  also  incorporate  a  line  search  which 
involves  finding  a  scalar  a  which  is  the  solution  to  the  equation 

iApf,'  =  0. 

Having  found  a,  we  then  obtain  a  new  estimate  of  c*  by  setting  c  :=  c  +  ap. 
We  repeat  this  procedure,  updating  D  and  v  as  necessary,  until  we  have 
obtained  c*  to  sufficient  accuracy. 

Weighting 

We  choose  to  generalise  (2)  by  introducing  weights  to  obtain  a  weighted  Huber 
M-estimator  of  the  form  F  =  '^ip{nh),  where  7  is  the  Huber  threshold, 

Wi  is  the  weight  associated  with  the  zth  datum,  and  Vi  is  the  residual  associated 
with  the  zth  datum.  It  may  be  necessary  to  introduce  weights  in  this  way  in 
order  to  deal  with  non-identically  distributed  errors  in  the  data,  in  which  case 
the  weights  may  be  chosen  to  be  the  reciprocals  of  the  standard  deviations  of 
the  underlying  probability  distributions. 

Many  algorithms  exist  to  find  unweighted  Huber  fits,  and  in  general, 
adapting  them  to  find  a  weighted  Huber  fit  is  a  straightforward  task.  As  an 
example,  we  show  how  to  adapt  a  Newton-like  method. 

Weighted  Huber  algorithm. 

1)  Calculate  Vi  =  Wip'{ri/'y). 

2)  If  ^A^DyjAp  =  -A'^v  is  consistent,  define  p  :=  -^{A^ Dy;vA)+A'^v, 

Otherwise,  define  p  :=  ~~P~^A^v,  where  P  is  a  positive  definite 
matrix. 

3)  Find  a  steplength  a  >  0  such  that  (Ap)^Pwp'((r  +  aAp)/^)  =  0. 

4)  Set  c  :=  c  -f  op. 

Here,  A  is  the  m  x  n  matrix  representing  the  underlying  linear  model,  is 
a  diagonal  matrix  with  entries 


{D^)ii  = 


Wi,  if  \rih\  <  1, 
0,  if  |ri/7|  >  1. 


P  is  usually  the  identity  matrix,  I  and  denotes  the  pseudo-inverse  of  a 
matrix  Y ,  defined  so  that  Y'^  is  that  matrix  X  of  the  same  dimensions  as  Y^ 
such  that  Y XY  ==  Y ,  XY X  =  X  and  Y X  and  XY  are  symmetric. 

We  note  here  that  there  are  many  other  algorithms  for  finding  a  Huber 
fit,  and  that  most,  if  not  all,  can  be  adapted  just  as  easily. 
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Lawson’s  algorithm 

This  algorithm,  analysed  by  Lawson  [7]  in  1961,  enables  an  fit  to  be  ob¬ 
tained  by  repeated  weighted  ^2  fits.  The  algorithm  itself  is  very  straightfor¬ 
ward,  and  involves  updating  the  weights  at  each  iteration  according  to 


(3) 


where  G{t)  =  \t\.  The  denominator  is  a  normalisation  term  to  ensure  that  the 
weights  sum  to  unity.  The  numerator  has  the  effect  of  weighting  data  with 
large  residuals  more  heavily,  with  the  result  that,  in  the  limit,  only  those  data 
with  a  maximal  residual  will  have  any  weight  attached  to  them. 

Lawson’s  algorithm  finds  the  points  of  extreme  oscillation  and  weights 
these  accordingly  to  obtain  the  best  Eco  approximation.  The  other  weights 
are  not  important,  and  in  fact  converge  to  zero. 

Initial  values  for  the  weights  are  usually  chosen  to  be  =  1/m,  as 
this  treats  all  the  data  equally  and  satisfies  the  condition  that  the  sum  of  the 
weights  must  be  unity.  Proofs  of  convergence  require  that  the  {(l>i{x)}  form 
a  Chebyshev  set,  but  experimental  results  (see,  for  example,  [4])  suggest  that 
the  algorithm  is  more  generally  applicable. 

A  summary  of  Lawson’s  algorithm. 

1)  Set  all  weights  equal  (with  the  sum  of  weights  equal  to  unity). 

2)  Perform  a  weighted  least-squares  fit. 

3)  Calculate  the  residuals  from  the  weighted  least-squares  fit. 

4)  Update  the  weights  according  to  Lawson’s  formula  (3). 

5)  Return  to  Step  2  until  convergence  is  obtained. 


§3.  The  Algorithm 

We  are  concerned  with  the  solution  of  the  problem 


min  max  \ri\, 

c  {T',:iri|<7} 


where  ri  is  the  residual  for  the  datum  {xi,yi),  and  7  is  the  Huber  threshold 
value.  In  order  to  solve  this  problem,  we  reformulate  it  as 

m 

miny^u;ip(ri/7), 

i=l 

where  p  is  defined  as  in  equation  (1),  and  we  adopt  an  iterative  procedure  to 
find  c  by  performing  successive  weighted  Huber  fits.  The  weights  are  updated 
after  each  iteration  in  a  manner  similar  to  Lawson’s  original  algorithm.  While 
Lawson’s  algorithm  is  concerned  with  finding  a  minimax  fit  via  a  sequence 
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of  weighted  least-squares  fits,  this  new  algorithm  finds  a  minimax  fit  to  the 
non-outlying  data  via  a  sequence  of  weighted  Huber  fits. 

Unfortunately,  Lawson’s  rule  for  updating  the  weights  cannot  be  used  in 
this  new  algorithm  since  the  rule  would  weight  the  outliers  too  heavily.  As 
a  result,  the  outliers  would  be  fitted  more  accurately  at  the  next  iteration. 
The  essential  point  of  Lawson’s  update  is  to  weight  those  datum  points  which 
correspond  to  the  maximal  errors  of  the  minimax  fit.  To  maintain  this  general 
trend,  we  need  an  update  in  which  the  function  G  in  (3),  rather  than  being 
monotonic,  instead  increases  to  a  peak  and  then  decays,  with  the  peak  corre¬ 
sponding  to  the  residual  with  the  largest  magnitude  which  does  not  exceed  7. 
The  latter  is  termed  the  “7-maximal  residual”  and  denoted  by  7m ij* 

The  function  we  choose  in  place  of  \t\  is  a  negative  exponential  of  the 
form 


\ti 


if  \t\  <  IMR, 
if  1^1  >  IMR, 


and  we  update  the  weights  at  each  iteration  according  to  (3).  (Note  that 
changes  with  the  iteration  /.) 

For  \t\  >  '^mr,  the  'yuR  factor  in  is  needed  to  ensure  continuity 

at  =  7mh  and  the  term  in  the  exponential  is  used  so  that  the 

left  and  right  derivatives  of  Gi{t)  are  continuous  at  jmr-  The  reason  for  this 
second  condition  is  to  ensure  that  points  with  residuals  just  over  Jmr  and 
those  with  residuals  just  less  than  7m a  are  treated  similarly. 


§4.  Convergence 

We  have  obtained  favourable  results  with  this  algorithm,  provided  that  certain 
conditions  are  met.  Firstly,  the  form  of  the  approximating  model  needs  to  be 
appropriate.  For  example,  trying  to  approximate  a  set  of  data  corresponding 
to  a  quadratic  by  a  straight  line  will  probably  lead  to  problems,  as  it  is  likely 
that  a  considerable  number  of  the  data  will  be  treated  as  outliers.  Secondly, 
7  needs  to  be  chosen  carefully.  If  7  is  chosen  to  be  too  small,  then  there  may 
be  many  solutions  and  it  may  not  be  possible  to  predict  to  which  solution  the 
algorithm  will  converge  —  if  it  converges  at  all. 

We  therefore  conclude  that  in  order  to  use  this  algorithm  effectively,  we 
first  need  to  have  some  details  of  the  problem  we  are  to  tackle.  If  we  are 
unsure  as  to  what  sort  of  model  to  fit  to  the  data,  then  7  should  be  chosen  to 
be  larger  than  we  might  initially  require.  If  we  are  unsure  what  value  of  7  to 
choose,  then  some  sort  of  7-reduction  procedure  may  be  effective  for  finding  an 
appropriate  value.  An  initial  value  of  7  may  be  chosen  by  use  of  the  formula 
7  =  1.9906  X  median(|ri  -  median(ri)|)  (see,  for  example,  Ross  et  al,  [12]). 

The  effect  of  using  a  Lawson-like  update  with  a  non-monotonic  factor  is  to 
increase  the  weights  at  the  extrema  of  the  minimax  approximation  and  reduce 
all  other  weights,  including  those  of  the  outliers.  In  practice,  the  algorithm 
produces  a  minimax  approximation  to  a  subset  of  the  data  with  the  aim  that 
this  subset  should  be  the  non-outlying  data.  Unfortunately,  we  have  been 
unable  thus  far  to  prove  convergence  for  this  algorithm.  However,  it  should 
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be  noted  that  the  convergence  rate  would  be  expected  to  be  similar  to  that  of 
Lawson’s  original  algorithm  as  they  essentially  do  the  same  task. 


§5.  Acceleration  Schemes 

Although  the  algorithm  as  it  stands  is  acceptable  for  small  problems,  it  nev¬ 
ertheless  takes  a  considerable  length  of  time  to  achieve  relaxed  convergence 
conditions.  This  is  no  surprise  as  one  of  the  shortcomings  of  Lawson’s  al¬ 
gorithm  is  its  slowness  to  converge.  More  specifically,  the  convergence  of 
Lawson’s  algorithm  is  linear  with  a  ratio  of  r*  [11],  where 


T*  =  max 


max  r* 


:  r  <  1 


with  r*  defined  to  be  the  vector  of  residuals  from  the  optimal  fif-  In 
many  situations,  this  ratio  can  be  very  close  to  one,  leading  to  rather  slow 
convergence. 

One  technique  to  increase  the  rate  of  convergence  is  to  use  the  fact 
that,  upon  convergence,  the  weights  corresponding  to  non-extremal  residu¬ 
als  are  zero.  Specifically,  after  a  set  number  of  normal  iterations  to  allow 
the  weights  to  settle  a  little,  we  may  set  =  0  if  jrij  <  cr^/l|r’i||oo5  where 

^  ■  This  latter  technique  is  the  one  presented  by  Rice 

and  Usow  [10],  although  Ellacott  [4]  found  that  it  could  cause  the  algorithm 
to  fail. 

Of  course  in  the  case  of  this  new  algorithm,  these  schemes  cannot  be 
applied  directly.  We  need  to  compensate  for  those  data  which  are  being  treated 
as  outliers,  thus  this  scheme  is  not  valid.  If  it  were  possible  to  find  some 
analogue  of  a  for  this  new  algorithm,  then  it  may  be  possible  to  use  that 
analogue  in  an  acceleration  scheme. 


§6.  Numerical  Results 

We  have  tested  this  algorithm  extensively  and  now  present  some  numerical 
results  to  illustrate  it.  In  Figure  1,  we  show  a  synthesised  data  set  consisting 
of  95  points  lying  close  to  the  polynomial  f{x)  =  —  3a:  +  1  with  5  outliers. 

Figure  1  also  shows  the  best  fitting  quadratic  polynomial  to  the  data  obtained 
by  a  least-squares  fit,  by  an  ^oo  fit  and  by  the  new  algorithm  presented  in  this 
paper.  The  noise  in  the  data  is  taken  from  a  uniform  distribution  on  [-0.1, 0.1] 
and  we  thus  choose  7  =  0.1.  Table  1  shows  the  results  from  the  various  fits 
performed.  It  is  clear  that  both  the  £2  and  £00  fits  are  unsuitable  and  are 
affected  by  the  outliers.  However,  the  new  algorithm  succeeds  in  identifying 
the  outliers  and  successfully  ignoring  them.  Comparing  the  results  from  the 
new  algorithm  with  those  from  performing  least-squares  and  minimax  fits  to 
the  data  without  outliers,  we  see  that  they  are  much  more  in  agreement.  In 
fact,  as  we  would  expect,  the  new  algorithm  has  generated  an  almost  identical 
fit  to  the  ioo  fit  on  the  accurate  data. 
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Various  quadratic  fits  to  data 


Fig.  1.  Various  quadratic  polynomial  fits  to  a  set  of  data  with  outliers. 

We  also  note  that,  while  the  new  algorithm  seems  to  be  significantly 
faster  in  this  example,  this  is  not  the  case  in  general.  In  fact  with  stricter 
convergence  criteria,  Lawson’s  original  algorithm  applied  to  the  non-outlying 
data  converges  in  fewer  iterations  than  the  new  algorithm.  The  reason  that 
the  minimax  fit  to  the  data  containing  outliers  takes  fewer  iterations  is  due  to 
the  result  in  Section  5  involving  r*,  which,  because  of  the  outliers,  is  actually 
quite  small  (r*  =  0.9497)  compared  to  r*  =  0.9894  for  the  case  of  the  new 
algorithm. 


4 

loo 

New 

4  (NO) 

^oc  (NO) 

Co 

+0.9499 

+0.9839 

+0.9958 

+1.0059 

+0.9971 

Cl 

-2.7868 

-0.4571 

-2.9951 

-3.0007 

-3.0001 

C2 

+2.1568 

+2.0492 

+2.0057 

+2.0050 

+2.0035 

Iterations 

1 

46 

39 

1 

140 

Tab.  1.  Numerical  results:  fitting  a  quadratic  (NO  :  No  outliers). 

The  convergence  criterion  was  the  same  for  both  Lawson-like  algorithms, 
namely  that  the  magnitude  of  the  four  largest  7-maximal  residuals  should 
agree  with  a  relative  error  of  less  than  10~^.  In  addition,  no  acceleration 
schemes  were  used  since  we  needed  to  obtain  a  measure  of  how  fast  the  algo¬ 
rithms  were  in  their  unaccelerated  form. 

§7.  Conclusions 

We  have  presented  an  algorithm  for  fitting  a  linear  form  to  data  containing  uni¬ 
form  noise,  contaminated  by  outliers.  Future  work  will  concentrate  on  three 
main  areas.  Firstly,  acceleration  of  the  convergence  of  the  algorithm.  Sec¬ 
ondly,  extension  to  non-linear  forms.  Thirdly,  extension  to  general  fp  norms 
rather  than  solely  to  the  ^00 -norm. 
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A  Segmentation  Method  under  Geometric 
Constraints  after  Pre-processing 

D.  Apprato,  J.  B.  Betbeder,  C.  Gout,  and  S,  Vieira- Teste 


Abstract.  For  a  geophysical  image  with  homogeneous  grey  levels,  we 
propose  a  method  of  segmentation  that  could  be  subdivided  into  two  parts: 
the  first  one  concerns  a  pre-processing  of  the  image  which  provides  an 
enhancement  of  some  features  present  on  the  image.  The  originality  of  the 
method  consists  in  using  a  scale  transformation  applied  to  the  pixel  values 
of  the  image.  The  second  part  presents  a  segmentation  method  using 
deformable  surfaces.  The  originality  of  this  segmentation  method  is  that 
it  considers  the  active  contour  model  as  a  set  of  articulated  curves,  which 
corresponds  to  the  interfaces  between  different  layers  and  faults.  Moreover, 
the  a  priori  knowledge  of  well  data  allows  us  to  make  some  geometric 
constraints  on  the  model.  The  solution  is  obtained  by  minimization  of  a 
nonlinear  functional  under  constraints  in  a  suitable  convex  set.  Solving  the 
minimization  problem  consists  in  particular  in  a  fc-order  Taylor  formula 
applied  to  linearize  the  nonlinear  term. 


§1.  Segmentation  Pre-processing 

Image  segmentation  is  one  of  the  most  important  steps  leading  to  the  analysis 
of  processed  image  data.  Its  main  goal  is  to  divide  an  image  into  parts  that 
have  strong  correlation  with  objects  or  areas  of  the  real  world  contained  in  the 
image.  The  image  is  divided  into  separate  regions  that  are  homogeneous  with 
respect  to  a  chosen  property  such  as  brightness,  color,  reflectivity,  context,  etc. 
However,  in  certain  cases,  the  grey  levels  of  an  image  could  be  homogeneous 
and  make  the  segmentation  more  difficult  to  realize.  This  is  particularly  true 
in  the  case  of  geophysical  and  medical  images  (cf.  [14,15]).  In  the  first  part  of 
this  work,  we  propose  a  method  to  solve  this  problem  using  families  of  scale 
transformations.  The  use  of  scale  transformations  is  common  in  imaging.  The 
aim  of  this  pre-processing  is  an  improvement  of  the  image  function  data  that 
suppresses  unwilling  distortions,  or  enhances  some  image  features  important 
for  further  processing.  It  provides  improvement  of  the  contrasts,  and  it  rep¬ 
resents  a  tool  to  pre-process  images  used  in  most  computer  algorithms  today. 
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According  to  Sonka,  Haclav  and  Boyle  [15],  the  pre-processing  of  images  may 
be  classified  into  four  categories  (pixel  brightness  transformations,  geometric 
transformations,  pre-processing  methods  that  use  a  local  neighborhood  of  the 
processed  pixel,  and  image  restoration  that  requires  knowledge  about  the  en¬ 
tire  image)  according  to  the  size  of  the  pixel  neighborhood  that  is  used  for  the 
calculation  of  a  new  pixel  brightness.  The  transformation  of  the  brightness 
and  of  the  contrast  of  an  image  allows  us  to  focus  on  phenomena  that  are  hard 
to  see  in  the  plain  image. 

For  a  given  image,  we  are  going  to  consider  the  pixel  values  as  a  topo¬ 
graphic  map:  the  brightness  value  of  each  pixel  is  the  height  of  the  (hyper-) 
surface  at  that  point.  For  a  data  set  of  pixels  {xi,yi,  Zi,  A  {xi,yi,  Zi))^,  we 
apply  the  following  functions: 

•  Cd:  A  (x^,t/,,z,)iC  [0,255]  ^[0,255], 

•  r‘'(v^do(<;doA))G/f”^(fl,]R), 

•  V'd  {^d  0  (q^>A)))  converges  to  C  o  A  when  d  converges  to  0, 

where  A  is  an  attribute  function  introduced  in  Section  2.1,  <;d  (resp.  (pd  and 
ijjd)  are  scale  transformations  converging  to  c  (resp.  p  and  •0),  and  is  a 
spline  operator  (see  Arcangeli  [2]). 

The  scale  transformation  q  converges  to  a  usual  brightness  transforma¬ 
tions  (;  (see  Apprato  and  Gout  [1]):  for  instance,  ^  could  be  a  scale  transfor¬ 
mation  which  enhances  the  image  contrast  between  brightness  values  pi  and 
P2- 

Let  us  consider  the  subdivision  {ui,  W2,  j,^d) 

interval  [0,255]  satisfying  (^(A{xi,yi,Zi))  =  ui,  p(d)  being  the  number  of 
different  pixel  values  of  the  image  (<  255  for  a  grey  scale  image).  The  function 
Cd  is  defined,  for  any  x  G  [A(xi,yi,Zi)  =  Wj,  A (a^j+i, Zj+i)  —  Wj+i],  and 
for  an  integer  1  <  i  <  p{d)  —  1,  by 

Cd(x)  =  Uiql^  [(a;  -  Wi)  /  {wi+i  -  u;^)]  -h  [{x  -  Wi)  /  (aui+i  -  w^)] 

-f  ai  {wi)  {wi+i  -  Wi)  [(a:  -  Wi)  /  {wi+i  -  ^n^)] 
ai  (wi+i)  (^e^+l  -  Wi)  q\,^  [(a:  -  Wi)  /  (^n^+i  -  't^i)] , 

where  the  for  Z  =  0,1,  and  j  =  1,  ...,m,  are  the  Hermite  finite  element 

basis  functions,  and  where  ai  (wj)  =  [uiJ^i—Ui)/  —  Wi)  and  ai  ('iep(d))  = 

(wp(d)  -  'Lep(d)-i)  /  (wp(d)  -  Up(d)_i).  Then,  Gout  [9]  showed  that  for  any  d  G 

D,  for  an  integer  i,  1  <i  <  p{d)  -  1,  Q  (w^i)  =  and  Q  ^  C"^([0, 255]). 

Likewise,  in  order  to  recover  a  finer  image,  it  is  useful  to  apply  the  “large 
variations”  algorithm  introduced  in  [9].  In  fact,  after  having  applied  the  func¬ 
tion  to  improve  the  contrast  of  the  image  (and  thus  increasing  the  variations 
of  the  corresponding  pixel  values),  it  is  very  useful  to  use  a  method  that  takes 
into  account  these  rapidly  varying  data.  Let  us  note  that  even  without  using 
the  scale  transformations  an  image  often  has  large  variations  (this  occurs 
for  example  when  a  dark  zone  is  near  a  brighter  one).  That  is  why  we  propose 
to  use  the  “Large  variations”  algorithm.  This  algorithm  uses  two-scale  trans¬ 
formations,  namely  tpd  for  the  pre-processing,  and  'ipd  for  the  post-processing. 
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The  first  one,  (pd,  is  used  to  suppress  the  oscillations  of  the  data.  The  pre¬ 
processing  function  (pd  is  such  that  the  data  do  not  present  large  variations, 
and  therefore  a  usual  spline  operator  (e.g.  [2])  can  subsequently  be  applied 
without  generating  significant  oscillations.  The  second  scale  transformation 
V'd  is  then  applied  to  the  approximated  values  to  map  them  back  and  obtain 
the  initial  approximated  pixel  values.  It  is  important  to  underline  that  the 
proposed  scale  transformations  do  not  create  parasitic  oscillations.  Moreover, 
this  method  is  applied  without  any  particular  knowledge  of  the  location  of  the 
large  variations  in  the  dataset. 

So,  for  pre-processing,  we  propose  two  algorithms:  in  the  first  one,  we  just 
apply  a  scale  transformations  Q  as  a  brightness  transformation  for  contrast 
enhancement,  in  the  second  one,  we  also  apply  the  “large  variations”  algorithm 
in  order  to  obtain  a  finer  represention  of  the  image  which  represents  the  main 
advantage  of  this  approach. 

The  reader  is  referred  to  [1,8,10]  for  a  complete  study  of  this  method, 
including  its  convergence  and  numerical  results.  Let  us  note  that  this  method 
is  also  efficient  for  noise  removal  as  shown  in  [1]. 


§2.  Segmentation  Method 

We  use  deformable  models  (external  forces,  evolution  term,  see  Kass,  Witkin 
and  Terzopoulos  [16,17])  and  classical  approximation  techniques  such  as  spline 
theory  (see  de  Boor  [3],  Laurent  [12],  Schumaker  [13])  and  the  finite  element 
method  [4]. 

We  propose  an  analytic  approach  which  uses  deformable  models  instead 
of  a  geometrical  one  as  done  for  instance  in  Sethian  [14].  We  recall  that  the 
principle  of  the  deformable  model  method  lies  in  attracting  the  representation 
towards  the  structure  using  forces: 

•  Internal  forces  describing  properties  of  elasticity  and  rigidity  of  the  rep¬ 
resentation,  connected  to  its  derivatives  (e.g.,  the  energy  of  thin  plates); 

•  External  forces  coming  from  potentials  which  characterize  the  elements 
of  the  structure  with  respect  to  the  attributes  data. 

Geometrical  constraints  are  associated  with  well  interpolation  conditions 
(case  of  geophysical  images  with  well  data).  Deformable  models  provide  a  way 
of  interactively  acting  on  the  representation  by  adding  a  dynamic  term  in  the 
minimization  problem  (see  for  instance  Cohen  and  Cohen  [5],  Cohen,  Cohen 
and  Ayache  [6],  and  Cohen,  Bardinet  and  Ayache  [7]),  that  permits  upgrading 
the  models  to  the  solution  of  the  minimization  problem  introduced. 

In  this  section,  we  first  give  the  geophysical  data  and  then  the  minimiza¬ 
tion  problem  is  studied.  The  nonlinear  problem  and  its  discretization  are 
given  in  the  subsequent  sections. 

2.1.  The  data  on  the  structure 

Two  types  of  data  are  available:  attribute  data  and  well  data.  For  each  at¬ 
tribute  A,  the  attribute  data  are  {xi,yi,Zi,A  {xi,yi,  Zi)) ,  where  {xi,yi,  Zi)  are 
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the  coordinates  of  the  barycentre  of  a  voxel,  and  A  {xi,yi,  Zi)  is  the  attribute 
value  A  in  this  voxel.  The  well  data  are  depth  data: 

{xj,yj,Zj)  ^  =  Oj  where  N  is  the  number  of  interpolation  points.  This 
model  allows’ a’ conceptual  representation  of  the  structure  by  identification 
of  its  various  elements,  and  permits  topological  connections  between  those 
elements.  This  model  induces  the  parameterization  of  the  structure.  Each 
element  of  the  structure  (layer,  fault,  etc.)  is  identified  with  a  connection  of 
four  points  with  a  label  S.  Furthermore,  each  quadruplet  is  connected  by  two 
points  (which  can  be  thought  as  a  common  side  of  a  “quadrilateral”  repre¬ 
sented  by  the  four  points)  with  another  quadruplet.  Practically,  the  a  priori 
model  can  be  constructed  by  introducing  a  3D  block  and  a  regular  grid  of  this 
block.  The  aim  is  to  find  a  space  of  admissible  representations  consistent  with 
the  a  priori  model  and  the  criteria  connected  with  the  data.  Therefore,  it  is 
necessary  to  choose  a  space  of  functions  characterized  by  a  domain  of  defini¬ 
tion  connected  with  the  a  priori  model  and  regularity  conditions  connected 
to  the  data.  The  idea  is  to  transform  the  a  priori  model  into  a  normalized 
model  called  the  model  of  reference  (denoted  by  M').  For  example,  we  can 
choose  M'  cTi=  [0, 1]  x  [0, 1]  x  [0, 1].  The  model  M'  is  then  the  image  by 
transformations  of  the  set  of  vertical  and  horizontal  closed  sides  of  the  a  priori 
model  as  done  in  [18].  Let  7  be  the  union  of  the  common  edges  of  any  two 
sides  of  M',  we  define  by  M  the  interior  of  M'  \  7.  All  the  functional  spaces 
needed  in  this  work  are  given  in  Vieira-Teste  [18] . 

2.2.  Minimization  criterion 

2.2.1.  Internal  forces:  The  criterion  a.ssociated  with  the  internal  forces 
is  a  classical  one.  Modelling  this  criterion  bring  us  to  the  following  energy 
functional:  for  any  v  eV  =  (M,  IR^)  fl  (7°  (M',Ill^), 

El  {v)  =  [v]I  m  +  H2,M  > 


with  (S)  >  0,  Vi  ^  1,2,  VE  e  M.  The  term  corresponds  to  an 

approximation  of  the  elastic  deformation  of  the  model  while  the  term  Iv]2^m 
corresponds  to  an  approximation  of  the  rigid  deformation  of  the  model  (cf. 
Cohen,  Cohen  and  Ayache  [6]). 
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2.2,2.  External  forces:  External  forces  are  issued  from  potentials  connected 
with  attributes.  We  introduce  the  following  energy,  for  any  v  eV, 

E2(v)=  ^  f  PE{v/^{s,r))dsdr, 


where  Fx;  is  the  potential  associated  with  the  element  parameterized  by  S. 

The  modelling  we  propose  consists  in  minimizing  the  previous  energies  Ei 
and  E2  as  we  will  see  in  subsection  2.2.4.  In  the  case  of  the  velocity  attribute, 
we  use  the  following  potential  to  define  the  layers: 


P{x,y,z) 


—a 


^(x,y,z) 


2 

) 


a  >  0 


where  A  is  the  attribute  “velocity  of  propagation  of  the  seismic  wave” . 


2.2.3.  Interpolation  data:  If  we  suppose  some  parameterization  (sj,rj)  E 

M  of  each  interpolating  point  aj  =  ^j)  is  known,  then  we  require  that 

V  eV  satisfies  v  (sj,rj)  =  aj  for  any  j  =  1, ...,  N. 

2.2.4.  Minimization  criterion:  Using  the  notation  and  definitions  intro¬ 
duced  above,  we  consider  the  functional  E  defined  on  V  by 

Eiv)  =  lv]l  n^  A  [v]I  m /  P'^{'t^/B{s,r))dsdr 

for  any  v  E  V.  We  consider  the  set  K  associated  with  the  interpolation 
constraints,  and  defined  by 

K  =  {veV,  Vi  =  1, ...,  N,  V  =  aj}. 

This  set  is  convex  and  closed  in  V.  We  also  introduce  the  following  linear 
mapping  (continuous  on  V  with  the  norm  ||'||2 

Pq:v  EV  poV  =  E  (K^)^  . 

We  consider  the  following  minimization  problem:  find  a  E  K  satisfying 

yvEK,  E{a)<E{v). 

We  note  that  this  problem  is  nonlinear  on  the  convex  set  K  with  respect 
to  a.  There  are  two  techniques  to  treat  this  problem.  The  first  one  consists  in 
linearizing  the  nonlinear  term  (linked  to  the  potentials)  in  the  functional  E. 
The  second  one  consists  in  using  the  deformable  models  technique  as  done  in 
the  following  subsection:  we  suppose  that  the  solution  is  a  function  of  time, 
which  leads  to  a  new  evolution  problem  that  will  be  discretized  both  in  time 
and  space. 
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2.3.  The  nonlinear  problem 

In  this  subsection,  we  give  the  nonlinear  minimization  problem  and  its  dis¬ 
cretization.  Let  us  recall  that  the  deformable  models  technique  consists  in 
assuming  that  cr  depends  on  time,  and  so  consists  in  adding  a  dynamic  term 
to  the  functional  E  (a) 


{t,s,r)dsdr, 

where  {e  =  e  (S)  >  0.  This  term  allows  the  control  at  each  time  of  the 

deformation  of  the  surfaces. 

2.3.1.  Evolution  problem:  Let  T  >  0.  We  note 

W{0,T,V)  =  |u;G  L^(]0,T],F),^  €  {]0,  T] ,  V')|  ■ 

We  then  consider  the  following  evolution  problem  defined  on  [0, T].  For  any 
t  e  ]0,r]  and  any  w  €  W{0,T,V),  find  cr  G  W{0,T,V),a  {t)  G  K,  satisfying 

(Pt): 

E{a)  +  -^  f  e(M)cr'^{t,s,r)dsdr<E{ui)  +  \^  f  £ (M) (t, s, r) dsdr, 


with 

cr(0)  =  croei^  (M,]R^)  . 

We  are  currently  studying  existence  and  uniqueness  of  (Pt)  using  a  Lipschitz 
approximation  of  the  sign  function. 

Likewise,  for  any  t  €  ]0,  T] ,  we  consider  the  term 

ECAf 

The  variational  formulation  of  the  problem  (Pt)  with  Kuhn  and  Tucker’s 
relation  is,  taking  as  test  function  v  on  the  stationary  space  V  (necessary 
condition  without  uniqueness),  for  any  t  G  ]0,T]  and  any  v  €V,  find  (cr,  A)  G 

W{0,T,V)  X  C°  ([O.T] ,  (IR^)^)  ,<r(«)  G  K,  satisfying  (P): 

f  e  (M)  T)dsdr  +  a(CT(«),  w)  +  (A  (<) ,  Po>’>m,3  =  (»') 

JM 

under  conditions 

tr(0)  =  croei^(lR^) 

A(0)  =  Ao€(]R’)'', 


and 
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where 


^a.(u,v)  =  [v]l^  +  lv]lM- 


2.3.2.  Discretization  in  time:  In  this  subsection,  we  discretize  (P)  both 
in  time  and  space.  The  originality  of  this  discretization  consists  in  using  a 
k-order  Taylor  development  which  allows  us  to  fake  into  account  many  more 
voxels  and  so  to  improve  the  convergence  of  the  method  (see  Vieira- Teste  [18] 
for  more  details).  We  cut  the  interval  ]0,T]  into  sub  intervals  with  length  At. 
Consider 

tm  =  mAt^  m  =  l,...yDT- 

We  use  the  following  approximation  of  the  time  derivative: 

d(T  ^  (T  (tm)  ~  a  (tm-l) 

dt  ^  ~  At 

Assuming  that  =  a{tm)  and  =  A(tm)5  we  approximate  the 
variational  problem  as  follows:  For  any  m  =  1,...,Z)t  and  any  v  e  V,  find 
((t"*,  A">)  eVx  (IR^)"  ,<7™  €  K,  satisfying  (P„); 

f  e  (M)  a^vdsdr  +  At  ?;)  -|-  {X'^,pQv)j^  3 

Jm  ^  ’ 


^vdsdr  -f  AtL 


with  0-°  =  <70  e  (M,]R^)  and  A"  =  Aq  €  (E,^)" . 

The  previous  problem  is  implicit  and  nonlinear  with  respect  to  the  so¬ 
lution  We  propose  to  replace  (?;)  =  (^m)  by  a  Taylor  series 

expansion  of  order  A:  >  0  about  the  time  t^-  We  suppose  that  a  is  in 
([0,  T] ,  (M,  IR3))  .  We  have 


Lcrm  (v)  —  Ijcr,v  (^m) 

and  Z<or,t;  (tm)  —  (tm-l)  +  AtDLfj^y  (tnx-l)  +  {tm-l)  H + 

D^Lf^^y  (tm-l)-  We  note  that  the  problem  (Pm)  is  linear  and  explicit 
with  respect  to  cr”^.  The  following  result  is  based  on  the  Lax-Milgram  Lemma. 

Theorem.  The  problem  (Pm)  fias  a  unique  solution  (cr^,  X^) . 

2.3.3,  Discretization  in  space:  Let  be  a  nonempty  bounded  subset  in 
IR!!j.  for  which  0  is  an  accumulation  point.  For  any  h  E  H,  we  solve  the  min¬ 
imization  problem  (Pm)  in  the  finite  element  space  (14)^  C  V.  The  generic 
finite  element  are  the  Hermite  finite  element  of  class  for  snakes  and  the 
Bogner-Fox-Schmit  finite  element  rectangle  of  class  (see  [4])  for  deformable 
surfaces.  To  have  (14)^  C  V,  it  is  necessary  to  have  a  connection  on  7.  To 
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do  that,  it  is  sufficient  to  divide  some  degrees  of  liberty  connected  to  deriva¬ 
tives  as  done  in  [18]. 

We  denote  by  (oj", the  coordinates  of  in  the  basis  of  14  and 
by  (Af, A^)  the  coordinates  of  A”^  {Mh  =  dim(14)).  If  is  a  solution 
of  the  discretized  problem  (Pm)  ffi  we  can  write  in  the  basis  of 

{Vhf;  Vm  =  l,...,DT,  V«  =  1,2,3, 

Mh 

j=l 

with  {aj^y  e  IR  and  where  the  functions  of  14- 

In  the  following,  we  miss  out  q  and  h.  Taking  u  =  in  (P),  we  have 
to  solve  (for  q  =  1,2,3,  in  the  linear  problem  (P))  a  system  of  {Mh  +  N) 
equations  with  {Mh  +  N)  unknowns.  We  easily  show  that  this  system  has  a 
unique  solution,  and  that  the  matrix  R—[C,  P,*  B,  0]  (first  line  :  C,  P;  second 
line  :  *P,  0)  of  the  system  is  symmetrical  and  sparse  with 

“  1^3  ’  (v^j »  5  Ij 

Bj^i=  lSt'(pj{Si,Ti)  ,  j=l,...,M/,,  Z=1,...,A^, 

where  for  any  G  (14)^  , 

K'^1om=  /  e{M)-u{s,T)-v{s,r)dsdT, 

’  JM 

and  where  T  =  (of, unknown  vector.  We  obtain  a 
linear  system  RT  =  L,  where  the  lines  of  L  are 

f  £{M)(T'^~^(pidsdr  +  AtL^m  {(pi) , 

JM 


+  AtLam  {(fMh  •)  1 


Atai, 


Ataj\j. 

This  method  has  been  implemented  in  fortran,  C  and  C-i--f .  Numerical  ex¬ 
amples  on  real  data  are  given  in  [11,18]. 
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Building  Adaptive  Multiresolution  Schemes 
within  Harten’s  Framework 


Francesc  Arandiga  and  Rosa  Donat 


Abstract.  We  consider  the  cell-average  framework  described  by 
A.  Harten  in  [5],  and  build  the  prediction  operator  using  two  nonlinear 
interpolation  techniques.  We  test  the  resulting  nonlinear,  adaptive,  mul¬ 
tiresolution  scheme,  and  compare  it  with  a  linear  scheme  of  the  same 
accuracy.  The  nonlinear  prediction  processes  we  develop  can  also  be  used 
in  the  context  of  iterative  refinement.  Numerical  tests  show  that  this  is 
also  a  viable  alternative  for  piecewise  smooth  data. 


§1.  Introduction 

The  goal  of  a  multi-scale  decomposition  of  a  discrete  set  of  data  is  a  “re¬ 
arrangement”  of  its  information  content  in  such  a  way  that  the  new  discrete 
representation,  exactly  equivalent  to  the  old  one,  is  more  “manageable”  in 
some  respects.  Some  of  the  best  known  applications  of  multi-scale  decomposi¬ 
tions  derive  from  their  compression  capabilities:  a  multiresolution  representa¬ 
tion  of  a  function,  i.e.,  of  a  discrete  set  which  represents  the  function  in  some 
sense,  can  be  highly  compressed  with  minimal  loss  of  information  content. 
Precisely  because  of  this  potential,  multi-scale  techniques  have  an  emergent 
role  in  numerical  analysis,  where  the  multi-scale  idea  has  been  used  success¬ 
fully  over  the  years,  from  multigrid  techniques  to  hierarchical  bases  in  finite 
element  spaces  or  subdivision  schemes  in  Computer-Aided  Design  (CAD). 

In  the  late  80’s  and  early  90’s,  ideas  from  all  these  fields,  together  with 
a  wide  experience  in  the  numerical  solution  of  Hyperbolic  Conservation  Laws 
(HCL)  lead  Ami  Harten  to  develop  a  General  Framework  for  multiresolu¬ 
tion  representation  of  discrete  data.  The  building  blocks  of  a  multiresolution 
scheme  a  la  Harten  are  two  operators  which  connect  discrete  and  continuous 
data;  The  discretization  operator  obtains  discrete  information  from  a  given 
signal  (belonging  to  a  particular  function  space)  at  a  given  resolution  level; 
the  reconstruction  operator  produces  an  approximation  to  that  signal  (in  the 
same  function  space)  from  its  discretized  values. 
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Kartell’s  point  of  view  is  that  the  way  in  which  the  discrete  data  is  gener¬ 
ated,  i.e.,  the  discretization  process,  determines  its  nature  and  should  provide 
an  adequate  setting  for  a  multiresolution  analysis.  Once  the  setting  is  spec¬ 
ified,  the  choice  of  an  appropriate  reconstruction  operator  provides  the  key 
step  to  the  construction  of  a  multiresolution  scheme. 

The  reconstruction  process  lies  at  the  very  heart  of  a  multiresolution 
scheme  built  a  la  Harten,  and  adaptivity  can  be  introduced  in  the  multires¬ 
olution  scheme  at  this  level.  A  nonlinear,  adaptive  reconstruction  technique 
which  fits  the  approximation  to  the  local  nature  of  the  data  will  lead  to  a 
nonlinear  adaptive  multiresolution  algorithm  with  improved  compression  ca¬ 
pabilities. 

The  aim  of  this  study  is  to  examine  a  particular  class  of  nonlinear  adaptive 
multiresolution  schemes,  those  using  the  Essentially  Non  Oscillatory  (ENO) 
interpolatory  techniques  of  [6]  in  the  reconstruction  step.  Numerical  experi¬ 
ments  [1,2]  show  that  ENO-MR  schemes  have  larger  compression  rates  than 
linear  ones  when  the  original  signal  or  image  is  composed  of  smooth  parts 
joined  together  by  singularities.  ENO  techniques  can  be  used  to  construct 
very  accurate  interpolants,  which  in  turn  lead  to  multiresolution  schemes  with 
high  compression  capabilities.  When  the  original  signal  is  geometric,  nonlin¬ 
ear  schemes  can  be  used  as  loss-less  compression  techniques,  and  we  show 
some  application  of  this  in  the  last  section  of  this  paper. 

The  nonlinear  prediction  process  can  be  used  also  in  the  context  of  sub¬ 
division  refinement.  This  amounts  to  setting  to  zero  all  scale  coefficients  and 
using  the  prediction  operator  to  proceed  by  dyadic  refinement.  Preliminary 
tests  show  that  these  nonlinear  subdivision  schemes  lead  to  non-oscillatory 
limiting  functions  when  applied  to  piecewise  smooth  data  with  jumps,  and 
open  up  an  interesting  alternative  for  iterative  refinement  of  piecewise  smooth 
data. 


§2.  Cell  Average  Multiresolution  Analysis 

When  dealing  with  discrete  data  coming  from  a  piecewise  smooth  function, 
the  simplest  discretization  process,  that  of  considering  the  point-values  of  the 
function,  might  not  be  well  defined,  especially  at  jump  discontinuities.  On 
the  other  hand,  the  discretization  by  cell- averages  procedure  acts  naturally 
on  the  space  of  integrable  functions,  and  it  provides  a  more  adequate  setting 
to  deal  with  piecewise  smooth  signals.  Because  of  this,  we  shall  carry  out  our 
numerical  study  within  the  cell-average  framework. 

Images  are  considered  here  as  two-dimensional  signals,  and  we  use  the 
usual  tensor-product  approach  to  design  our  two-dimensional  algorithms. 
Thus,  we  only  describe  the  essential  features  of  the  one-dimensional  setting 
for  the  sake  of  completeness.  The  interested  reader  can  find  the  missing  details 
in  this  section  in  [2]  or  [5]. 

Let  us  consider  a  set  of  nested  dyadic  grids  in  [0,1]: 

x'l  =  ihk,  /(fc  =  2-ViVo.  JVfc  =  2‘iVo,  fc  =  L,...,0, 
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where  Nq  is  some  integer.  The  discretization  by  cell  average  operator  is  defined 
as  follows: 


Vk-.L\0,1]^V’’,  =  =  f(x)dx,  l<i<Nk,  (1) 


where  T^[0, 1]  is  the  space  of  absolutely  integrable  functions  in  [0, 1]  and 
is  the  space  of  sequences  with  Nk  components. 

Due  to  the  relation 


fT^  =  T^  f  =  ^  I"  + fi). 

hfc-i  ^ 


it  is  ecisy  to  see  that  A:  =  T  - 1, . . . ,  0,  can  be  evaluated  directly  from 

without  using  explicitly  (1)  (i.e.,  without  knowledge  of  the  original 
function  f{x)). 

To  define  an  appropriate  reconstruction  operator  for  this  setting  (in  fact, 
a  whole  family  of  them),  we  consider  the  sequence  {F-^}  on  the  k-th.  grid 
defined  from  the  cell  values  {fi}  as  follows: 


*  _ 

Ft  =  hi,'£,fl=  f{x)dx  =  F{xl, 
s=i  Jo 


fl  = 


pk  _  pk 

i  *  — 1 


(2) 


The  function  F{x)  (g  C[0,  1])  is,  in  fact,  a  primitive  of  the  original  function 
/(x),  and  the  sequence  {F*}  represents  a  point  value  discretization  of  F(x) 
on  the  k-th  grid  (with  Fq  —  0).  Notice  that  (2)  establishes  a  one-to-one 
correspondence  between 

Let  us  denote  by  X(x]F^~^)  an  interpolatory  reconstruction  of  the  set 
on  the  grid  i.e.,  I(Xi~^;F^~^)  =  Then,  we  obtain  an 

approximation  ,  /*,  to  using  (2)  as  follows: 


Since  fjt  =  obtain 


(3) 


fLi  =  and  S^i  =  -2:(x5i_i,  F'=-')). 

Let  us  define  the  prediction  errors  as  ef  :=  -  f^.  Using  (2)  and  (4),  we 

easily  obtain 

/li-l  “  /li-l  =  ~(f2i  ~  f2i)’ 


Thus,  we  can  simply  store  only  the  prediction  errors  with  odd  indexes;  these 
are  the  scale  coefficients,  df  =  e2i-i>  multiresolution  transform. 
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The  multiscale  decomposition  of  the  original  data  is  described  by  the 
encoding  algorithm: 

{Do  k  =  L, . . .  ,1 

fT^  =  l{fli-i  +  fli)  l<i<iV.-i  (5) 

4  =  /‘i-i  -  i<i< 

We  recover  the  original  data  with  the  decoding  algorithm: 

{Do  k_  =  1, . . . ,  L 

In  our  study  we  consider  only  local  interpolation  techniques  with  La- 
grangian  polynomials,  i.e., 

2:(x;F*)  =  ?i(x;F‘)  xe[4i,4. 

where  qi(x]  F^)  is  a  polynomial  of  degree  r  satisfying  F^)  =  F^_i  and 

When  the  stencil  of  points  used  to  construct  qi{x)  is  symmetric  around 
the  iih  interval  (i.e.,  r  =  2s- 1,  <S  =  •  •  ’ ,  we  obtain  a  centered 

interpolation  technique.  Centered  interpolation  techniques  are  very  often 
used  in  approximation  theory  because  they  minimize  the  interpolation  error, 
thus  leading  to  very  accurate  reconstructions  of  smooth  signals.  It  turns  out 
that  the  multiresolution  schemes  obtained  from  (5)  and  (6)  with  Lagrangian 
piecewise  polynomial  centered  interpolation  techniques  are  equivalent  to  the 
Biorthogonal  Wavelet  (BOW)  schemes  of  [4]  (with  the  box  function  as  the 
scaling  function). 

The  compression  properties  of  BOW  schemes  have  been  widely  analyzed 
in  the  literature,  but  from  an  approximation  theory  standpoint,  it  is  very 
easy  to  study  the  behavior  of  the  coefficients  in  terms  of  the  smoothness  of 
the  underlying  signal  and  the  accuracy  of  the  interpolation  technique.  Notice 
that  the  scale  coefficients  df  are  related  to  interpolation  errors  at  the  odd 
points  of  the  fc-th  grid.  In  fact, 

Thus,  if  f{x)  is  sufficiently  smooth  at  we  have  d^  =  0(/i^_i). 

However,  the  presence  of  an  isolated  singularity  x^  G  induces  a 

loss  of  accuracy  in  the  polynomial  pieces  whose  stencils  cross  the  singularity. 
The  accuracy  loss  is  related  to  the  strength  of  the  singularity  as  follows  [2]:  if 

=  0(1)  {p  <  r),  and  /  is  smooth  everywhere 

else,  we  have 


d 


k  _ 
I  — 


0(/i^_i),  otherwise. 


(7) 
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Thus,  centered  interpolation  techniques  lead  to  relatively  large  regions  of 
poor  accuracy  around  singularities,  and  therefore  to  large  detail  coefficients 
at  those  locations  where  the  accuracy  loss  takes  place.  The  consequence  is  a 
loss  in  efficiency  for  the  multiresolution-based  compression  scheme. 

It  seems  reasonable  to  improve  the  efficiency  of  the  multiresolution-based 
compression  scheme  by  improving  the  accuracy  of  the  interpolatory  technique 
used  in  the  reconstruction  step.  Notice  that  when  the  convex  hull  of  the  sten¬ 
cil  used  to  construct  a  polynomial  interpolant  is  contained  within  a  region  of 
smoothness  of  the  underlying  signal,  the  interpolation  error  (and  the  corre¬ 
sponding  detail  coefficient)  becomes  small.  Thus,  it  is  clear  that  the  key  point 
is  the  construction  of  polynomial  pieces  that  avoid  the  singularity. 

In  the  literature  related  to  the  numerical  solution  of  conservation  laws, 
where  discontinuities  can  spontaneously  develop,  we  find  an  interpolation  pro¬ 
cedure  with  all  the  features  we  need;  Essentially  Non  Oscillatory  (ENO)  in¬ 
terpolatory  techniques  [6]  (it  is  not  surprising  that  Harten  was  one  of  the 
developers  of  these  techniques). 

ENO  interpolatory  techniques  lead  to  piecewise  polynomial  interpolants 
that  are  fully  accurate  except  in  those  intervals  that  contain  singularities. 
The  essential  feature  of  ENO  interpolatory  techniques  is  a  stencil  selection 
procedure  that  attempts  to  choose  each  stencil  Si  within  the  same  region  of 
smoothness  of  F{x).  The  stencil  selection  process  uses  the  divided  differences 
of  the  discrete  set  to  be  interpolated  as  smoothness  indicators:  Large  divided 
differences  indicate  a  possible  loss  of  smoothness.  The  selection  process  is 
such  that  it  tends  to  look  away  from  large  gradients,  when  this  is  feasible. 

ENO  interpolatory  techniques  are  nonlinear,  because  the  stencil  used 
to  construct  each  polynomial  piece  depends  on  the  function  being  interpo¬ 
lated.  When  the  singularities  are  sufficiently  well  separated  (this  means  that 
there  are  at  least  r  +  1  points  in  each  smoothness  region),  ENO  techniques 
lead  to  stencils  such  that  (assuming  the  singularity  is  located  at  the  zth  cell) 
Si-i  n«Si+i  —  0.  Hence,  the  detail  coefficients  satisfy 

om)hu,  i=i, 

^  y  otherwise. 

Thus  ENO  interpolants  have  a  nearly  optimal  high  accuracy  region,  which 
should  in  turn  improve  the  efficiency  of  the  corresponding  multiresolution- 
based  compression  algorithms. 

The  case  of  a  corner  of  /  (i.e.,  a  jump  in  f')  is  especially  interesting 
because  it  is  possible  to  construct  an  even  better  (in  terms  of  local  accuracy) 
interpolant:  the  ENO-SR  interpolant. 

The  Subcell  Resolution  (SR)  technique  (also  due  originally  to  Harten  [3]) 
allows  us  to  obtain  an  approximation  to  the  location  of  an  isolated  corner  in  a 
continuous  function  up  to  the  order  of  the  truncation  error.  The  approximated 
value  is  then  used  to  modify  locally  (in  the  interval  where  the  discontinuity 
lies)  the  definition  of  the  piecewise  polynomial  interpolant  in  such  a  way  that 
the  interpolation  error  is  small  except  for  an  0(/'i^+^)  band  around  the  corner. 
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Fig.  1.  Left  original,  right  coarse  version. 


Fig.  2.  Left  linear,  right  ENO. 

Recall  that  in  the  cell-average  setting,  the  interpolation  process  is  applied 
to  the  primitive  function.  Since  jumps  in  f{x)  become  corners  in  F{x),  using 
an  ENO-SR  interpolant  in  the  reconstruction  step  lead  to  detail  coefficients 
satisfying 

di  =  0{hl_i),  except  when  x^i^i  ~  Xd-  (9) 

The  SR  technique  is,  thus,  appropriate  to  increase  the  efficiency  of  mul¬ 
tiresolution-based  compression  algorithm  for  piecewise  continuous  functions 
with  jumps. 


§3.  Numerical  Experiments 

Let  us  consider  a  purely  geometric  image  as  shown  in  Figure  1  (left),  and 
apply  to  it  the  tensor-product  version  of  algorithm  (5).  We  consider  piecewise 
polynomial  interpolants  of  degree  4,  thus  the  accuracy  of  the  reconstruction  in 
the  cell-average  framework  is  3.  In  Figure  2  we  display  the  location  of  non-zero 
scale  coefficients  in  the  multiresolution  representation  of  the  signal.  When 
using  the  ENO-SR  technique,  and  because  all  discontinuities  are  “aligned 
with  the  (tensor-product)  grid” ,  all  scale  coefficients  are  zero.  This  is  a  direct 
consequence  of  the  fact  that  the  ENO-SR  reconstruction  commits  no  error  at 
the  odd  points  in  each  one  of  the  resolution  levels  considered  (T  =  4  in  this 
example).  In  the  case  of  the  ENO  scheme,  the  scale  coefficients  at  the  highest 
resolution  level  are  all  zero.  This  is  a  consequence  of  the  nature  of  the  data 
(the  point- values  of  the  signal  at  the  highest  resolution  level),  which  locates 
all  discontinuities  at  the  cell  end-points.  The  ENO  technique  is  perfectly 
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Fig.  4.  Reconstruction  from  the  coarse:  linear,  ENO,  ENO-SR. 


accurate  when  the  discontinuity  is  located  at  a  grid  point.  The  technique  is 
fully  accurate  at  all  the  lower  resolution  levels  except  at  the  interval  where 
the  discontinuity  is  located.  Thus,  there  is  only  one  scale  coefficient,  located 
at  the  point  which  is  closest  to  the  singularity.  This  should  be  compared 
with  the  3  scale  coefficients  per  singularity  obtained  with  the  linear  scheme 
(the  lines  showing  the  location  of  non-zero  scale  coefficients  are  thicker  for 
the  linear  scheme).  For  the  sake  of  comparison,  the  number  of  non-zero  scale 
coefficients  in  each  case  is:  Linear  8554,  ENO  1688,  ENO-SR  0. 

We  turn  next  to  the  nonlinear  subdivision  scheme  obtained  by  considering 
the  ENO  and  ENO-SR  in  the  prediction  process.  In  Figure  3  we  show  a 
univariate  process.  Starting  from  the  the  cell-averages  of  a  piecewise  smooth 
function  at  a  very  coarse  level  (16  points),  we  proceed  by  dyadic  refinement 
until  we  obtain  1024  data.  The  numerical  results  clearly  indicate  that  no 
overshoots  or  undershoots  are  obtained  with  the  non-linear  techniques.  Again, 
the  excellent  properties  of  the  ENO-SR  technique  in  terms  of  approximation 
lead  to  the  best  results. 

In  Figure  4  we  show  a  simple  multivariate  test,  the  reconstruction  of  the 
geometric  figure  considered  before  from  a  very  coarse  representation  (right  in 
Figure  1).  The  Gibbs- like  oscillations  typical  of  linear  schemes  in  the  presence 
of  discontinuities  lead  to  the  blurring  of  the  edges  observed  in  Figure  4  (left). 
There  is  no  blurring  in  the  reconstructed  image  obtained  with  the  nonlinear 
techniques.  Again,  the  ENO-SR  scheme  leads,  in  this  simple  case,  to  the 
exact  original  image.  One  dimensional  cuts  of  the  reconstructed  figures  are 
displayed  in  Figure  5. 
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Fig.  5.  Horizontal  cuts  linear,  ENO,  ENO-SR. 
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A  Recursive  Approach  to  the  Construction  of 
k-Balanced  Biorthogonal  Multifilters 


Silvia  Bacchelli,  Mariantonia  Cotronei, 
and  Damiana  Lazzaro 


Abstract.  In  this  paper  we  discuss  some  numerical  aspects  of  a  par¬ 
ticular  construction  of  balanced  biorthogonal  multifilters  by  means  of  the 
lifting  scheme.  This  construction  allows,  by  simply  solving  linear  equa¬ 
tions,  to  obtain  multifilters  which  do  not  need  prefiltering,  and  for  which 
the  discrete  versions  of  polynomial  preservation/annihilation  properties, 
are  respectively,  satisfied  by  their  low  and  high-pass  branches.  In  par¬ 
ticular,  we  conduct  experiments  on  how  a  parameter  which  appears  in 
our  recursive  definition  of  balancing  can  be  chosen  to  suitably  influence 
the  spectral  behaviour  of  the  multifilter  low-pass  branch,  making  it  more 
effective  in  image  compression  problems. 


§1.  Introduction 

Multiwavelets  are  a  new  addition  to  the  classical  scalar  wavelet  theory,  and 
have  been  extensively  studied  in  the  last  six  years  [5,9,11,17].  The  main  mo¬ 
tivation  for  multiwavelets  is  that,  unlike  the  scalar  wavelet  case,  they  can 
simultaneously  possess  desirable  properties  which  are  found  to  be  useful  for 
image  compression  applications,  such  as  orthogonality  and  symmetry,  short 
support,  linear  phase,  a  high  approximation  order,  a  high  number  of  vanish¬ 
ing  moments,  etc.  This  combination  would  not  be  possible  in  any  real- valued 
scalar  wavelet.  In  fact,  all  real- valued  scalar  wavelets,  with  only  one  scaling 
function  and  one  mother  wavelet,  can  never  possess  all  the  above  properties 
at  the  same  time.  This  flexibility  of  vector-valued  wavelet  functions  is  due 
to  the  fact  that  multiwavelets  satisfy  conditions  in  which  matrix  rather  than 
scalar  coefficients  are  involved. 

However,  multiwavelets  lack  some  attributes  that  scalar  wavelets  possess, 
and  this  becomes  apparent  when  one  implements  the  discrete  multiwavelet 
transform.  In  particular,  in  the  scalar  case,  a  scaling  low-pass  filter  with  an 
approximation  order  k  refers  to  the  ability  of  the  low-pass  filter  to  reproduce 
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discrete-time  polynomials  up  to  a  degree  k-1,  while  the  corresponding  wavelet 
high-pass  filter  annihilates  discrete-time  polynomials  up  to  the  same  degree. 
This  property,  which  is  very  important  in  many  applications,  does  not  hold 
in  the  multiwavelet  case.  In  fact,  the  approximation  power  property  does 
not  assure  the  preservation  and  annihilation  of  discrete-time  polynomials  by 
the  low-pass  and  the  high-pass  branch  of  a  multi  wavelet-based  filter  bank, 
respectively. 

Moreover,  because  the  approximation  order  for  multiwavelets  is  not  ac¬ 
companied  by  the  additional  properties  mentioned  above,  in  applications  using 
multiwavelets,  a  preprocessing  or  prefiltering  step  is  necessary  to  obtain  an 
efficient  signal  or  image  compression.  A  detailed  investigation  of  prefiltering 
methods  can  be  found  in  the  literature  [12,21,7]. 

Recently,  to  overcome  these  problems,  Lebrun  and  Vetterli,  and  Selesnick 
[15,18]  introduced  the  concept  of  balanced  multiwavelets.  They  constructed 
orthogonal  multiwavelet  bases  whose  multifilter  coefficients  satisfy  the  discrete 
version  of  the  approximation  and  zero-moments  properties,  and,  at  the  same 
time,  avoid  the  use  of  prefilters  in  implementing  the  discrete  multiwavelet 
transform.  This  is  a  great  advantage  because  the  preprocessing  step  is  a 
crucial  point  in  multiwavelet-based  algorithms.  In  fact,  this  initialization  can 
sometimes  destroy  the  very  properties  a  multi  wavelet  basis  is  designed  to 
have.  Nevertheless,  the  above  authors’  construction  of  orthogonal  balanced 
multifilters  implies  the  resolution  of  non-linear  equations  that  are  solved  by 
the  Grobner  basis  method. 

Following  the  previous  authors’  idea,  in  order  to  avoid  the  difficulties  due 
to  the  above-mentioned  non-linearity,  in  [2]  we  have  given  a  simple  algebraic 
construction  of  A:-balanced  biorthogonal  multifilters  making  use  of  the  well- 
known  tool  called  the  lifting  scheme.  As  shown  in  [19],  the  lifting  scheme  pro¬ 
vides  a  simple  method  for  constructing  new  biorthogonal  filters  with  requested 
properties,  starting  from  an  assigned  set  of  biorthogonal  analysis-synthesis  fil¬ 
ters.  In  [2]  we  have  extended  the  lifting  scheme  to  the  multifilter  case,  and  in 
so  doing,  we  have  exploited  the  additional  degrees  of  freedom  left  in  the  multi¬ 
filter  construction  after  satisfying  the  perfect  reconstruction  condition  in  order 
to  easily  construct  finite  ^-balanced  multifilters.  Our  results  have  been  stated 
using  the  algebraic  framework  of  banded  block  recursive  matrices,  exploiting 
this  flexible  mathematical  tool  to  translate  both  the  fc-balancing  conditions 
and  other  desirable  properties  in  terms  of  simple  linear  conditions  on  the 
multifilter  coefficients. 

In  this  paper,  we  discuss  some  numerical  aspects  of  the  procedure  for  the 
construction  of  biorthogonal  balanced  multifilters  given  in  [2] ,  and  analyze  in 
particular  the  effect  of  the  choice  of  the  shift  constant  p  which  appears  in 
our  definition  of  fc-balancing  on  the  compression  capabilities  of  this  kind  of 
filters.  In  fact,  shift  constant  p  plays  an  important  role,  and  it  can  be  used  as 
a  further  degree  of  freedom. 

Starting  from  Lazy  multifilters,  ^-balanced  multifilters  of  order  2  and  4 
are  constructed,  and  their  effectiveness  in  image  compression  is  tested  on  the 
Lena  image.  Using  numerical  experiments,  we  observe  that  the  p  parameter 
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influences  the  shape  of  the  Fourier  transform  of  the  scalar  filters  associated 
with  the  low-pass  matrix  coefficients,  and  we  determine  the  value  of  p  in  such 
a  way  that  the  spectral  behaviour  of  the  newly  constructed  low-pass  filters  is 
as  close  as  possible  to  the  optimal  shape.  With  this  selection  of  p,  we  obtain 
the  best  compression  results. 

We  remark  that  the  aim  of  this  paper  is  essentially  to  show  the  flexibility 
of  our  tool  in  building  multifilters  which  do  not  need  prefiltering  and  which 
are  easily  found  by  solving  simple  linear  equations. 

§2.  Balanced  Biorthogonal  Multifilters 

Let  {H  =  Zi  Hit\W  =  Zi  and  {H  =  Hit\  W  =  Ei  Wif)  be  two 
pairs  of  block  Laurent  polynomials  associated,  respectively,  with  the  analy¬ 
sis  and  j^e  synthesis  phase  of  a  FIR  multifilter  bank,  where  {Hi},  {Wi}, 
{Hi},  {ITi}  are  finite  sequences  of  r  x  r  matrices.  In  the  following  section  we 
will  refer  to  {H,  W},  {W,  W}  as  analysis  multifilters  and  synthesis  multifilters, 
respectively,  where  we  can  think  of  them  either  as  the  sequences  of  matrix 
coefficients  or  as  their  associated  block  Laurent  polynomials. 

Let  R(t‘^,'H),R{t^,yV),R(t‘^,H),R{t‘^,W)  be^he  block  banded  Hurwitz 
matrices  whose  generating  functions  are  H,  W,  H,  W,  respectively.  With  these 
matrices,  we  can  give  an  algebraic  description  of  the;  action  of  the  analysis- 
synthesis  system  on  a  block  Laurent  polynomial  a  given  as  input,  in  the 
following  way: 

Analysis: 

Synthesis: 

[a]  =  [<.(“)]  + 

where  and  represent  the  output  of  the  analysis  phase,  while  a  repre¬ 
sents  the  output  of  the  synthesis  phase  and  therefore  of  the  whole  FIR  system. 
Given  any  pair  of  multifilters  {W,  W},  define  the  2-decimated  matrix 

A  _\no  Hi' 

whose  elements  are  the  2-decimated  block  series  related  to  7i,  W,  that  is 

Hk  =  '£H2i+kt\  Wk  =  fc  =  0,1. 

i  i 

Definition  1.  We  say  that  the  pairs  {H,  W},  {H,W}  are  biorthogonal  mul¬ 
tifilters  or  duals  to  each  other  or,  equivalently,  that  they  satisfy  the  Perfect 
Reconstruction  (PR)  property  if 
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In  this  case,  if  furthermore  H  and  H  admit  a  convergent  subdivision 
scheme,  then  it  is  possible  to  define  corresponding  multiscaling  functions  and 
multiwavelets  from  the  well-known  matrix  two-scale  relations: 

^x)  =  ^/2  R{t\  n)  $(2x),  i(x)  =  V2  R{t^,  H)  i{2x), 

9{x)  =  V2R{t^,W)^2x),  §(®)  =  ^/2/^(t^W)$(2x), 

where  ^ (x),  ^ (a;)  represent  the  vector  containing  the  translates  of 

the  r-vectors  ^  =  [(/►O)  •  •  •  V’  =  [V’o,  •  •  • ,  V'r-i]^*  ^  =  [0o,  •  •  •  ? 

We  now  extend  the  concept  of  balancing  order  (introduced  in  [15])  to 
biorthogonal  multifilters.  We  require  that  the  multifilters  associated  with 
the  analysis  system  must  satisfy  the  discrete  versions  of  both  the  polynomial 
preserving  and  zero  moment  properties. 

Definition  2.  A  pair  of  multifilters  {H,  W}  related  to  the  analysis  phase  of 
a  FIR  system  is  said  to  be  balanced  of  order  k  (or  k-balanced),  if  there  exists 
at  least  one  real  number  p  such  that  the  following  relations  hold: 

R(t^,n)xl7rn]  =  V2  2^l{7r  +  p)nl 

n  =  0, . . . ,  /c  -  1,  (1) 

i^(t^>v)xK]  =  o, 

where  [tTh]  and  [{7r-fp)„]  are  bi-infinite  column  vectors  which  can  also  be  seen 
as  r-block  vectors  associated  with  the  formal  block  series 


(ri)" 

(ri  -h  p)^ 

."  =  E 

(ri +  1)" 

(’r  +  p)"  =  E 

(ri  +  1  +  p)" 

i 

.{ri  -hr  -  1)" . 

i 

.(ri  -f  r  —  1  -f  p)” . 

In  [2]  an  equivalent  condition  to  (1)  has  been  given  which  turns  out  to  be 
more  useful  in  practice: 

Theorem  3.  A  pair  of  FIR  multifilters  {W,  W}  is  balanced  of  order  k  if  and 
only  if 


(2) 


{rj)" 

'  P"  1 

h 

E^i 

[rj  +  1)" 

=  V22" 

(p+1)" 

j=h 

.{rj  +  r  -  1)". 

.(p  +  r-  1)". 

r 

(n)" 

1 

E  W'i 


j=mi 


{rj  +  1)- 


l{rj  -f  r  -  1)"  J 


=  0, 


for  n  =  0, . . . ,  A:  -  1. 


(3) 
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§3.  Construction  with  the  Lifting  Scheme 

The  lifting  scheme  (introduced  by  Sweldens  [19])  is  a  flexible  tool  for  the  con¬ 
struction  of  biorthogonal  bases.  In  [2]  an  extension  to  the  multifilter  setting 
has  been  given.  In  short,  given  a  set  {H.H.W.W}  of  biorthogonal  multifil¬ 
ters,  then  the  new  multifilters 

in"""  =^H^{Sot^)w 
=  ot^)n, 

where  i^s  any  block  Laurent  polynomial,  gives  rise  to  a  new  set  , 

y^new^  >V}  of  biorthogonal  multifilters. 

Analogously,  by  simply  changing  the  roles  of  the  previous  multifilters, 

^nA(sot‘^)w 
w  =w-{s  ot^)n 

gives  rise  to  a  new  set  W,  W,  of  biorthogonal  multifilters. 

We  call  (4)  and  (5)  respectively  the  lifting  scheme  and  the  dual  lifting  scheme. 

In  [2]  some  useful  conditions  are  given  which  allow  the  new  multifilters 
to  inherit  symmetry/antisymmetry  properties  from  the  starting  multifilters. 

We  can  take  advantage  of  the  previous  scheme  to  construct  new  balanced 
biorthogonal  multifilters.  In  fact,  unlike  the  orthogonal  case  where  the  balanc¬ 
ing  and  the  orthogonal  conditions  give  rise  to  non-linear  equations,  which  in 
[15,18],  for  example,  are  solved  with  a  Grdbner  basis  approach,  our  balancing 
conditions  (2)  and  (3)  applied  to  the  lifted  (or  dual  lifted)  multifilters  give  rise 
to  linear  conditions.  The  main  steps  of  our  approach  are: 

1)  Construct  the  new  low-pass  multifilter  coefficients,  using  the  dual  lifting 
scheme; 

2)  Apply  the  balancing  condition  (2),  and  solve  the  linear  equations  to  find 
the  coefficients  of  the  unknown  dual  lifting  matrix  polynomial; 

3)  Construct  the  new  high-pass  multifilter  coefficients,  with  the  lifting 
scheme; 

4)  Apply  the  balancing  condition  (3),  and  solve  the  linear  equations  to  find 
the  coeflftcients  of  the  unknown  lifting  polynomial; 

5)  Construct  the  corresponding  dual  low  and  high-pass  multifilters. 

It  is  important  to  note  that  in  applying  the  balancing  condition  (2), 
a  value  must  be  assigned  to  the  shift  parameter  p.  In  our  experiments,  it 
turns  out  that  p  influences  on  the  effectiveness  of  the  multifilters  in  their 
applications. 
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Fig.  1.  \h^\  +  |/i^|  associated  to  the  Lazy  2-balanced  low-pass  multifilter  with  a 
varying  p:  from  left  to  right,  p  =  —1, 0.25, 1. 


§4.  Examples 


In  the  following  example,  we  start  from  Lazy  multifilters  to  obtain  balanced 
biorthogonal  multifilters  of  order  2  and  4,  which,  furthermore,  are  of  the  type 
symmetric/antisymmetric  (see  [2]). 

We  restrict  to  the  case  r  =  2,  and  define  Lazy  multifilters  as  follows. 


27i  =  H  =  \/2/,  W  =  2W  =  V2It. 


In  order  to  show  the  influence  of  p  on  the  performance  of  the  new  multifilters, 
we  introduce  the  notation 


m,n  =  0,l. 


which  give  the  two  low-pass  scalar  filters  hP  —  —  {^fc}fc€ZZ  ob¬ 

tained  by  reorganizing  the  set  of  2  x  2  low-pass  matrix  multifilter  {Hk}keZj 
as  a  multichannel  scalar  filter  bank. 

As  shown  in  the  following  figures,  the  shift  constant  p  influences  the  shape 
of  the  Fourier  transforms  of  making  them  more  or  less  suitable 

for  application  problems. 

In  Figure  1,  we  show  3  graphs  of  the  sum  +  with  p  varying 

in  {—1, 1/4, 1}.  It  can  be  seen  that  the  choice  p  =  1/4  gives  visually  a  better 
low-pass  behaviour.  In  this  case  the  new  coefficients  (except  for  a  factor  \/2) 
are 
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Fig.  2.  \h^  \  +  associated  to  the  Lazy  4-balanced  low-pass  multifilter  with  a 
varying  p:  from  left  to  right,  p  =  0, 0.4, 0.8. 


A  second  example  (Fig.  2)  shows  the  behaviour  of  the  scalar  low-pass 
filters  associated  to  the  Lazy  4-balanced  multifilters,  with  different  choices  for 
the  parameter  p.  In  this  case,  the  choice  p  =  0.4  provides  the  best  behaviour 
of  the  low-pass  filters. 

We  have  experimented  with  the  above  multifilters  in  an  image  compres¬ 
sion  example  (on  the  Lena  image),  by  making  use  of  a  multiwavelet-based  em¬ 
bedded  coding  [6] .  Results  obtained  with  the  best  choices  of  2  and  4-balanced 
Lazy  multifilters  are  compared,  at  same  compression  ratio  1:16,  with  those 
produced  by  Chui-Lian  (CL)  [5]  and  Geronimo-Hardin-Massopust  (GHM)  [9] 
multifilters.  CL  and  GHM  multifilter  both  have  approximation  order  2,  but 
need  prefiltering.  For  comparison  purposes,  this  prefiltering  step  has  been 
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Fig.  3.  Reconstructions  of  Lena  compressed  with  different  bases.  From  the  top 
left  corner:  2-balanced  Lazy  multifilter  with  p  =  0.25;  4-balanced  Lazy 
multifilter  with  p  =  0.4;  CL  without  prefiltering;  GHM  without  prefilter¬ 
ing. 


G.R. 

2-bal.  Lazy 

4-bal.  Lazy 

CL  without  pref. 

GHM  without  pref. 

16 

26.61 

28.12 

11.79 

21.72 

Tab.  1.  PSNR  values  (in  dB)  with  different  multifilters. 


omitted,  in  order  to  show  how  a  prefiltering  is  absolutely  necessary  when 
dealing  with  non-balanced  multifilters. 

These  results  are  shown  in  Table  1.  Figure  3  shows  the  reconstruction 
of  Lena  compressed  with  the  above-mentioned  bases.  It  can  be  seen  from  the 
table  and  the  figure  that  2-balanced  Lazy  multifilters  behave  better  than  CL 
and  GHM.  Better  results  are  of  course  achieved  by  the  4-balanced  multifilters. 

In  the  above  experiments,  we  have  not  taken  into  account  the  orthonor¬ 
mal  balanced  multiwavelets  of  Lebrun- Vetterli  [16]  (which  definitely  give  the 
best  results,  due  to  their  good  spectral  properties),  since  our  aim  was  not  to 
construct  the  best  possible  filters,  but  to  show  the  flexibility  of  our  tool  in 
building  multifilters  which  do  not  need  prefiltering  and  which  are  easily  found 
by  solving  simple  linear  equations.  One  can  obtain  more  effective  filters  with 
this  procedure  by  extending  the  length  of  the  lifting  polynomials,  and  by  using 
one  of  the  many  well-known  good  strategies  for  filter  construction. 
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Fast  Evaluation  of  Radial  Basis  Functions: 

A  Multivariate  Momentary  Evaluation  Scheme 

R.  K.  Beatson  and  E.  Chacko 


Abstract.  This  paper  presents  a  scheme  for  fast  evaluation  of  splines, 
or  “radial”  basis  functions,  of  the  form  s(-)  =  p(-)  +  ~  ^i)- 

Here  p  is  a  low  degree  polynomial  and  $  :  IR”  — ♦  IR  is  a  function  that 
need  not  be  radial.  This  multivariate  momentary  evaluation  scheme  is 
a  generalization  of  the  fast  multipole  method  in  which  calculations  with 
far  field  expansions  are  replaced  by  calculations  involving  moments  of  the 
data.  The  primary  advantage  of  this  new  algorithm  is  that  it  is  highly 
adaptive  to  changes  in 


§1.  Introduction 

This  paper  presents  a  scheme  for  fast  evaluation  of  splines,  or  “radial”  basis 
functions,  of  the  form 


A)  =  P(-)  +  EAi#(.-.0-  (1) 

i=l 

Here  p  is  a  low  degree  polynomial,  and  $  :  H”  IR  is  a  function  that  need  not 
be  radial.  This  multivariate  momentary  evaluation  scheme  is  a  generalization 
of  the  fast  multipole  method  in  which  calculations  with  far  field  expansions 
are  replaced  by  calculations  involving  moments  of  the  data.  The  primary 
advantage  of  this  new  algorithm  is  that  it  is  highly  adaptive  to  changes  in  $. 
More  precisely,  changing  to  a  new  basic  function  $  only  requires  coding  a  one 
or  two  line  function  for  the  (slow)  evaluation  of  In  contrast,  adapting  a 
conventional  fast  multipole  code  to  a  different  $  requires  considerable  analysis 
of  appropriate  expansions  and  transformation  theorems,  followed  by  writing 
a  very  specific  code.  The  current  algorithm  reduces  the  incremental  cost 
of  a  single  extra  evaluation  from  0{N)  to  0(1)  operations,  and  the  cost  of 
a  matrix-vector  product  (that  is,  evaluation  at  all  centres)  from  0(N'^)  to 
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0{N  log  N)  operations.  The  algorithm  can  be  viewed  as  a  spline  fitter  in 
that  the  approximation  it  produces  is  a  piecewise  polynomial  supplemented 
by  appropriate  direct  evaluations.  The  method  described  is  a  multivariate 
generalisation  of  the  method  of  [4] . 

In  outline  the  setup  phase  of  method  is  as  follows.  Firstly,  space  is  divided 
in  a  hierarchical  manner.  For  example,  in  a  2-D  setting  an  initial  square 
could  be  divided  into  a  quadtree.  Then  centres  are  associated  with  the  panels 
they  lie  in.  Next,  proceeding  up  the  tree  level  by  level,  the  moments  of  the 
coefficients  (the  A^’s)  about  panel  centres  are  calculated.  Next  working  down 
the  tree  for  each  level,  a  number  of  approximations  to  $  are  formed.  Then  for 
each  panel  within  a  level,  polynomial  approximations  to  that  part  of  s  due  to 
far  away  centres  are  formed  by  combining  moments  and  the  approximations 
to  The  evaluation  phase  first  identifies  the  childless  panel  to  which  the 
evaluation  point  belongs.  Then  it  approximates  the  far  field  by  evaluating 
the  polynomial  associated  with  that  panel,  and  adds  to  that  approximation 
the  directly  calculated  near  field  part  of  s(x).  For  reasons  of  space,  we  will 
not  detail  suitable  methods  for  subdividing  space,  or  the  process  of  evaluation. 
These  matters  are  well  understood  in  the  context  of  the  fast  multipole  method, 
see  for  example  [2]. 

The  paper  is  organized  as  follows.  The  necessary  mathematics  for  form¬ 
ing  polynomial  approximations  to  s  from  moments  and  approximations  to 
and  for  translating  moments  is  given  in  Secton  3  below.  Section  4  contains 
symmetry  results  that  can  substantially  reduce  the  amount  of  work  required  to 
form  approximations  to  $  at  each  level.  Section  5  contains  numerical  results 
obtained  with  a  preliminary  implementation  of  the  method. 

§2.  Notation 

We  will  need  the  following  notation.  A  multi-index  a  =  (ai,...,Q;n)  is  an 
n-tuple  of  nonegative  integers.  If  x  is  an  element  of  M",  we  will  write  its 
components  as  xi,X2^ . . .  ,Xn-  We  will  also  need  sequences  of  points  in  IR". 
In  an  effort  to  make  the  meaning  of  all  subscripted  symbols  transparent,  we 
will  write  all  such  sequences  of  vectors  as  {zm},  and  2:  will  never  be  used 
unsubscripted  to  denote  a  single  point  in  IR,”. 

If  a  and  b  are  elements  of  H”,  then  we  will  say  a  is  less  than  or  equal  to  6, 
and  write  a  <  6,  if  Oj  <  bi  for  all  1  <  2  <  n.  We  also  define  for  x  G  IR”  and  a,  (3 

multi-indices  |a|  =  oti-\-ot2-\ - fan,  =  cti!a2!  •  •  ■  a;"  =  x^^x^^  •  •  • 

and  for  0  <  /?  <  a  take  (multivariate)  Binomial 

Theorem  then  assumes  the  form 

-  E 

0</3<a 

for  all  multi-indices  0  <  a  G  and  points  x^t  £  IR”.  We  further  define  the 
normalized  monomial 


Vcc{x)  =  x“/a! 


(2) 
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M 

~  ^  ^  ~  ^m)'  (3) 

m=l 

Also  define  [a,  &]  to  be  the  n  dimensional  box  {x  G  IR"  :  a  <  x  <b},  and  e  to 
be  the  n-vector  [1, 1, ,  1]^. 

§3.  Moment  Expansions 

Lemma  1  below  shows  that  we  can  form  a  polynomial  approximation  si  to 
a  spline  5  of  the  form  (1)  by  combining  moments  of  the  weights  A  with  the 
coefficients  of  a  polynomial  approximation  g  to  $  on  a  “double  width”  panel. 

Lemma  1.  Approximation  via  moments  -  correlations.  Let  c,  d  e  H”  with 
c,  d  >  0.  Let  ten"  with  |  >  Cj  +  dj  for  1  <  z  <  n.  Let  e  >  0  and  ^  be  a 
function  in  C\t  —  (c  +  d),  t  +  (c  +  d)].  Let 

q[x)  =  O'oya{x-t) 

{a:0<a<fce} 


be  a  polynomial  of  coordinate  degree  k  such  that 

11^  -  9||L°°[t-(c+d),t+(c+d)]  <  e  • 

Given  centres  zi,  Z2, . . .  zm  with  Zm  G  [— d,  d]  C  IR”  for  1  <  m  <  M,  and 
weights  Ki,  K2,  . . . ,  Km  E  IR,  let  the  corresponding  “radiaF  basis  function 

M 

5(2;)  =  ^  ^  (4) 

m=l 


be  approximated  by 


M 

Si{x)  =  ^  Kmq{x  -  Zm)  • 
m=l 


(5) 


Then 

||5-5l||L<x=[t_c,t+c]  <e||K||l  . 

(6) 

Moreover, 

Si(a?)  =  ^  ^  b/^Vpi^x  —  t), 

(7) 

{/3;0</3<fce} 

where 

(8) 

{a:/3<a<A:e} 

40 


R.  K.  Beatson  and  E.  Chacko 


Remark:  Often  in  our  applications  of  this  lemma,  will  be  truly  radial,  i.e. 
of  the  form  $(•)  =  (/►(||  •  II2),  for  some  function  (f)  of  one  variable. 

Proof:  If  -c  <  X  - 1  <  c,  then  for  1  <  m  <  M  we  have 


— (c  +  d)  <  {x  —  Zm)  —  t  ^  c  +  d. 

This  shows  all  but  the  expression  for  5i  in  terms  of  the  moments  of  the  data. 
To  see  the  latter,  write 

M 

si(a:)  =  ^  -  Zm) 

m=l 


M 


—  ^  ^  ^  ^  aQiVa{^X  Zm 

m=l  0<a<fce 

m=l  ^0<a<fce  J 

-£<-{  E  5  E 

m=l  ^0<a<fce  '  0</3<a  '  ) 


M 


=  E  1  E  ““  E 

m=l  lo<a<*:e  0</3<a 


{X  -  t)^  {-Zm) 


a-13 


P\  {a  -  D)\ 


=  E  {  E  ~ 

0<c«<l!e  lo<;3<a 


=  E  i  E  >  V>(X  -  t).  □ 

0</3<fce  I  ^<a<fce  1 


An  efficient  way  to  form  the  approximation  5i  is  to  use  real  FFTs  to 
compute  the  correlation  of  equation  (8).  Nominally,  such  a  process  involves 
three  FFTs  per  correlation.  However,  things  may  be  arranged  so  that  the 
transforms  of  moments  of  panels,  and  those  of  the  required  approximations 
to  are  computed  once  and  used  many  times.  Also  sequences  of  coefficients 
can  be  summed  in  the  transform  domain  rather  than  the  function  domain. 
This  lowers  the  average  number  of  FFTs  per  correlation  dramatically.  Thus, 
in  2  dimensions  the  work  per  correlation  is  reduced  to  approximately  + 

hk'^logk  operations,  where  h  is  small. 

A  proof  similar  to  that  of  Lemma  1  shows  the  following  total  degree 
version  of  the  approximation  via  moments  lemma. 
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Lemma  2.  Let  c^d  £  IR”  with  c,d  >  0.  Let  t  G  H”  with  jii|  >  Cj  +  bi,  1  < 
i  <  n.  Let  e  >  0  and  ^  be  a  function  in  C[t  ~  {c-\-  d),  t  +  (c  +  d)].  Let 

{a:0<a  and  |a|<k} 


be  a  polynomial  of  total  degree  k  such  that 

11^  “  9||L~[t-(c+d),<+(c+d)]  <  ^  • 

Given  centres  zi,Z2y.  ■  ■  with  Zm  €  [—d^d[  forl<m<M  and  weights 
ki,K2,.  ,  km  G  R,  let  the  corresponding  “radial”  basis  function 

M 

s(a;)  =  ^  ^  ~  Zm)} 

m=l 

be  approximated  by 

M 

Sl(ic)  =  ^Kmq{x-Zm)- 
m=l 


Then 

Moreover, 


where 


||s  -  5i||Lo=[t_c,t+c]  <  €||«||i  . 

{P’-0</3,  \0\<k} 

bp  —  5  y  (^a^O,a—P  • 

{a:P<a,\a\<k} 


The  next  lemma  shows  that  shifted  moments  can  be  expressed  as  a  convo¬ 
lution  of  moments  about  a  given  point.  This  result  will  be  used  in  generating 
the  moments  corresponding  to  a  larger  panel  of  centres  indirectly  from  the 
moments  corresponding  to  subpanels.  The  indirect  process  will  be  more  effi¬ 
cient  than  direct  formation  when  the  number  of  centres  is  large  because  the 
operation  count  for  the  indirect  shift  depends  on  the  order  of  the  moments, 
not  on  the  number  of  centres. 

Lemma  3.  Indirect  shifting  of  moments  -  convolutions.  Let  zi,..  .,zm  be  given 
points  in  K”  and  ki,  . . . ,  km  be  corresponding  weights.  Let  ay^a  be  the  a-th 
normalized  moment  of  the  data  deGned  in  equation  (3).  Then  for  all  v,u  G  R" 
and  multi-integers  a 


<^v,a 


^  ^  'w)(Tu,a— /3  • 

0</3<a 


(9) 
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Proof: 


M 

=  -  Zm) 

m=l  0</3<a 


=  V^{v  □ 

0</3<a 

In  using  Lemma  3  to  translate  moments  we  can  reduce  the  operation 
count  by  using  either  FFT  convolution  or  a  tensor  product  approach. 

We  discuss  firstly  the  tensor  product  approach.  The  tensor  product  ap¬ 
proach  may  be  viewed  as  making  a  shift  from  u  to  u  not  in  a  single  step,  but 
as  a  series  of  shifts  in  the  coordinate  directions.  For  simplicity  we  will  discuss 
only  the  2-dimensional  case. 

Consider  the  formula  (9).  lfv  —  u  =  (x,  0),  then  we  see  immediately  that 
Vj3(v  ~u)  is  nonzero  only  when  the  second  component  of  (3^  /?2,  is  zero.  Thus 
for  V  having  the  same  second  component  as  u, 

Ov.oc  =  X]  -  Wiyu,a-(/5:,0)  •  (10) 

0</5i<ai 

Considering  as  an  array  indexed  by  7  and  the  calculation  of  moments  of 
degree  not  exceeding  A:,  equation  (10)  above  implies  that  each  row  of  may 
be  calculated  in  O(fc^)  flops.  Thus  if  v  and  u  have  the  same  second  component 
all  moments  of  degree  not  exceeding  k  can  be  shifted  in  0(k^)  flops.  Similar 
remarks  apply  to  and  when  w  and  v  differ  only  in  their  second 

components.  Thus,  in  the  2-dimensional  case  this  tensor  product  strategy 
reduces  the  flop  count  for  a  single  shift  of  all  moments  of  degree  not  exceeding 
k  from  0{k^^  to  0(^k^). 

An  alternative  is  to  use  FFT  convolution  to  compute  the  transformation 
of  the  moment  shifting  lemma.  Lemma  3.  The  corresponding  operation  count 
is  0[k'^\ogk)  in  the  2-dimensional  case. 

§4.  Symmetry  and  Approximations  to  $ 

In  this  section  we  will  show  how  symmetry  considerations  can  greatly  reduce 
the  number  of  approximations  to  $  that  need  to  be  computed.  In  the  2- 
dimensional  situation,  with  a  quad  tree  subdivision  of  space,  and  without 
clumping,  there  are  40  different  geometries  of  source  and  target  for  each  level. 
The  method  requires  approximations  to  on  all  the  corresponding  double 
rectangles.  However,  for  most  choices  of  $  the  number  of  approximations 
that  need  to  be  computed  from  scratch  is  reduced  to  7  by  the  symmetry 
relations  of  Lemma  4  below.  Related  symmetry  considerations  for  the  fast 
multipole  method  are  discussed  in  Wang  and  LeSar  [7] . 
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Lemma  4.  Symmetries  and  approximations  of  “radial"  functions.  Let  Ji, 

Jn,  be  subsets  of  R  and  1  <  p  <  oo,  and  let  #  and  q  be  functions  in 
L^IJi  X  •••  X  Jn). 

•  Suppose  $  is  an  even  function  of  the  k-th  component  of  x,  Xk.  DeSne  a 
function  q  by 

q{xi,  .  .  .  j  Xk  ,  .  .  .  ,  2^n)  —  9(^1  j  •  •  •  5  ^ki  •  •  '  :  ^n)i 

for  all  {xi,. . . ,  Xn)  £  Ji  X  ...  X  -Jk  x  .. .  x  Jn.  Then 

11^  -  Q\\LP{JiX-X-JkX:-XJn)  =  li^  ~  ^\\LP{JiX.-XJkX-XJn)-  (l^) 

•  Suppose  ^(y)  is  unchanged  by  permutation  of  the  components  yi,...,yn, 
of  y.  Let  It  be  any  permutation  of  the  integers  {1, 2, . . .  ,n},  let  be 
its  inverse,  and  dehne  q  by 

q{yi,  .  •  .  ,  2/n)  =  g(2/7r-i(l)’  •  ■  •  »  Vir-Hn)),  (13) 

for  all  y  €  ^^(i)  x  •  •  •  x  J^^^^n).  Then 

11^  -9||lp(^,(1)X-XJ,(„))  =  -  q\\LP(J^X-XJn)-  (14) 

Proof:  Let  q  and  q  be  as  in  the  statement  of  the  first  part  of  the  lemma. 
Let  X  =  {xi,"  ■  ,Xk,  -  ■  ■  Xn)  be  a  point  in  x  . . .  x  -Jk  x  . . .  x  Jn.  Using  the 
evenness  of  $  in  the  fc-th  component  of  x, 

l^(xi,..,,Xk>...,Xn)  -  q(xu...,Xk,...,Xn)l 

=  |$(xi ,  .  .  .  ,  Xk)  .  .  .  )  Xn)  q(xi)  .  •  '  )  Xk)  .  .  .  )  Xn)\ 

where  (a;i,  •  •  • ,  w,  •  •  • , x„)  is  a  point  in  x  •  •  •  x  Jn-  The  first  part  of  the 
lemma  follows  by  using  this  equality  in  the  appropriate  power  integrals 
and  essential  supremum. 

We  turn  now  to  the  second  part  of  the  lemma.  Let  q,  and  q  be  as 
in  the  statement  of  that  part  of  the  lemma.  Let  y  =  (yi, . . . ,  ?/„)  be  a  point 
in  Jr(i)  X  •  •  •  X  J7r(n)-  Using  that  #(y)  is  unchanged  by  permutations  of  the 
components  of  y,  and  defining  x  =  {xi,  ...,Xn)  =  (y7r-i(i)>  •  •  • ,  2/7r-i(n))5 

|<^(2/l,---,2/n)  -g(yi,---,yn)|  =  |^(2/l,---,2/n)  -^(2/7r-i(l),'--,y7r-i(n))| 

=  |$(a;i,...,x„)-g(a;i,...,a;„)|, 

where  (a^i, . . . ,  ain)  is  a  point  in  Ji  X  •  •  •  x  J„.  The  result  follows  by  using  this 
equality  in  the  appropriate  p^^  power  integrals  and  essential  supremum.  □ 

Remark:  Suppose  $(•)  =  (^(j|  •  [|p)  for  some  function  of  one  variable  (f>  and 
some  p-norm  for  IR”,  1  <  p  <  oo.  Then  $  is  even  in  all  the  components  of  x, 
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Fig.  1.  Symmetry  in  the  2D  setting. 

and  is  also  unchanged  by  permutations  of  the  components  of  x.  Hence,  this 
lemma  applies  to  all  such  generalized  radial  functions  and  allows  us  to  use 
symmetry  to  obtain  approximations  to  $  on  new  regions  from  those  on  old. 

Figure  1  shows  the  geometry  of  source  panels  to  target  panels  in  IR^ 
when  we  use  a  quad  tree  subdivision  of  without  clumping.  The  solid 
black  square  is  the  target,  or  evaluation  panel,  and  the  possible  source  panels 
are  numbered  0  through  39.  Actually,  the  sources  in  the  left-most  column 
and  bottom-most  row  would  not  be  used  for  the  illustrated  position  of  the 
evaluation  panel  within  its  parent.  However,  they  would  be  used  for  different 
positions  of  the  target.  If  the  source  is  [—d,  d]  and  the  target  is  [i  —  c,  ^+c],  then 
Lemma  1  requires  an  approximation  g  to  on  the  “double  width”  rectangle 
[t  —  (c  +  d),  t  +  (c  +  d)].  If  $  has  all  the  symmetries  of  Lemma  4,  then  at  each 
level  only  the  seven  approximations  corresponding  to  source  panels  0, ...  ,6 
need  be  calculated  directly.  The  33  other  approximations  are  easily  obtained 
by  symmetry.  The  relevant  symmetries  to  use  on  a  previously  calculated 
approximation  are  indicated  in  parentheses  in  panels  7  to  39.  For  example, 
the  notation  (2x)  in  source  panel  7  indicates  that  the  approximation  for  source 
panel  7  is  obtained  from  that  for  source  panel  2  by  symmetry  in  a;.  In  the 
function  domain  this  corresponds  to  negating  the  coefficients  of  odd  powers  of 
X.  In  the  Fourier  domain  it  corresponds  to  a  block  rearrangement  of  columns 
or  rows,  depending  on  which  correspond  to  x.  Similarly,  the  notation  (li) 
in  source  panel  10  indicates  that  the  approximation  for  source  panel  10  can 
be  obtained  from  that  for  source  panel  1  by  symmetry  in  x  and  y.  This 
corresponds  to  a  transpose  operation  on  the  coefficients  in  both  the  function 
and  Fourier  domains. 
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§5.  Numerical  Results 

Some  numerical  results  from  a  primitive  implementation  of  the  algorithm  are 
given  in  Tables  1-4.  In  this  implementation  the  core  tensor  product  polyno¬ 
mial  approximations  9  to  employed  as  in  Lemma  1,  are  formed  by  interpo¬ 
lation  at  shifted  and  scaled  Chebyshev  nodes  followed  by  economisation.  The 
moments,  and  the  coefficients  of  the  approximations  to  typically  have  a 
wide  dynamic  range  when  the  polynomial  degree  is  15  or  more.  Consequently, 
some  extra  device  is  needed  in  order  to  make  the  algorithm  stable,  especially 
when  the  FFT  is  used.  In  the  current  code  the  method  used  is  a  scaling  of 
moments  and  polynomial  coefficients  analogous  to  that  suggested  by  Green- 
gard  and  Rokhlin  [6]  for  the  2-D  fast  multipole  method.  The  device  may  be 
viewed  as  scaling  every  panel  at  every  level  to  be  [-2, 2]^. 

In  the  calculations  reported,  the  centres  are  approximately  uniformly 
distributed  on  [0,1]^.  If  the  number  of  centres  is  N,  then  in  Table  1  the 
number  of  evaluation  points  is  the  smallest  perfect  square  bigger  than  iV,  and 
in  Table  2  the  number  of  evaluation  points  is  the  smallest  perfect  square  bigger 
than  IQN.  In  both  cases  ^([a:,?/])  =  +x‘^  where  c  =  1/VN,  and 

all  the  coefficients  of  the  spline  (4)  are  1.  The  piecewise  tensor  product 
bivariate  polynomials  used  are  of  coordinate  degree  7. 


Tab.  1.  Moment  based  method  versus  direct  evaluation. 


#of 

centres 

Ord.  Alg. 
time 

FFT  Alg. 
time 

Direct 

time 

Ratio 

Abs. 

error 

Rel. 

error 

4000 

0.41 

0.27 

3.57 

13.16 

3.99E-04 

1.30E-07 

8000 

0.84 

0.58 

14.90 

25.51 

2.42E-03 

3.95E-07 

16000 

1.78 

1.23 

59.98 

48.82 

1.71E-03 

1.40E-07 

32000 

3.64 

2.65 

237.47 

89.75 

9.30E-03 

3.79E-07 

Tab.  2.  Moment  based  method  versus  direct  evaluation. 


#of 

centres 

Ord.  Alg. 
time 

FFT  Alg. 
time 

Direct 

time 

Ratio 

Abs. 

error 

Rel. 

error 

4000 

1.37 

0.97 

35.14 

36.28 

5.49E-04 

1.79E-07 

8000 

2.88 

2.00 

148.98 

74.49 

4.39E-03 

7.16E-07 

16000 

5.87 

4.17 

593.40 

142.23 

2.07E-03 

1.69E-07 

32000 

12.51 

8.98 

2356.92 

262.33 

1.82E-02 

7.43E-07 

The  timings  in  the  tables  are  in  seconds  on  an  Intel  pentium  based  ma¬ 
chine.  Timings  are  given  for  direct  evaluation  and  for  the  algorithm  both  with 
and  without  the  speed  benefits  of  FFT  convolutions  and  correlations.  Tables  3 
and  4  repeat  the  runs  with  polynomials  of  coordinate  degree  15. 
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Tab.  3.  Moment  based  method  versus  direct  evaluation. 


#of 

centres 

Ord.  Alg. 
time 

FFT  Alg. 
time 

Direct 

time 

Ratio 

Abs. 

error 

Rel. 

error 

4000 

1.19 

0.81 

3.57 

4.42 

3.47E-08 

1.13E-11 

8000 

2.97 

1.70 

14.90 

8.74 

1.74E-07 

2.84E-11 

16000 

5.93 

3.55 

59.98 

16.88 

1.35E-07 

l.lOE-11 

32000 

13.19 

7.74 

237.54 

30.69 

6.17E-07 

2.52E-11 

Tab.  4.  Moment  based  method  versus  direct  evaluation. 


#of 

centres 

Ord.  Alg. 
time 

FFT  Alg. 
time 

Direct 

time 

Ratio 

Abs. 

error 

Rel. 

error 

4000 

4.20 

2.19 

35.14 

16.06 

2.49E-08 

8.11E-12 

8000 

9.32 

4.51 

148.37 

32.89 

1.09E-07 

1.77E-11 

16000 

19.16 

9.32 

593.40 

63.65 

8.48E-08 

6.91E-12 

32000 

39.70 

19.52 

2356.92 

120.76 

3.86E-07 

1.57E-11 
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Poly  harmonic  Splines  in  R"*: 
Tools  for  Fast  Evaluation 


R.  K.  Beatson,  J.  B.  Cherrie,  and  David  L.  Ragozin 


§1.  Introduction 

As  is  now  well  known,  hierarchical  and  fast  multipole-like  methods  can  greatly 
reduce  the  storage  and  operation  counts  for  fitting  and  evaluating  radial  basis 
functions.  In  particular,  for  spline  functions  of  the  form 

N 

s(a;)  =  p(x)  +  ^  Xi(l){\x  -  a:i|),  (1) 

i=l 

p  a  low  degree  polynomial,  the  cost  of  a  single  extra  evaluation  can  be  reduced 
from  0{N)  to  0(1)  operations,  and  the  cost  of  a  matrix-vector  product  (that 
is,  evaluation  at  all  centers)  can  be  decreased  from  0{N‘^)  to  0{N). 

This  paper  outlines  some  of  the  mathematics  required  to  implement  meth¬ 
ods  of  these  types  for  polyharmonic  splines  in  d  even,  that  is  for  splines  s 
corresponding  to  (f)  chosen  from  the  list 


0^(r)  = 


^2(£+l-d/2)^  ^  =  0,...,d/2-2, 

r2(^+i“‘^/2)  log(r),  ^  =  d/2-1,...  . 


(2) 


We  carry  out  most  of  our  work  in  the  general  setting  and  then  specialize 
to  d  =  4.  We  refer  the  reader  to  our  more  detailed  work  [1]  which  contains 
all  the  details  of  this  special  case.  We  are  currently  working  on  developing  all 
the  details  for  the  general  case. 

A  key  technique  in  our  development  is  the  exploitation  of  the  rotation 
group  invariance  of  radial  basis  functions.  This  means  that  we  exploit  the  fact 
that  any  kernel  k{x,  y)  =  ^(|a:  —  ^|)  will  be  rotation  invariant  in  the  sense  that 

k{gx,gy)  =  k{x,y),  for  all  orthogonal  g  E  0(d).  (3) 


Invariance  leads  to  many  crucial  simplifications  and  efficiencies  in  developing 
and  manipulating  the  polyharmonic  expansions  which  lie  at  the  heart  of  the 
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hierarchical  and  fast  multipole  expansions.  Related  development  of  general 
spherical  harmonic  expansions  based  on  these  techniques  can  be  found  in  [6, 
7,8]. 

We  will  not  detail  the  basic  framework  of  hierarchical  and  fast  multipole 
methods  within  which  this  mathematics  sits.  However,  we  do  recall  that 
an  essential  component  of  the  method  is  the  grouping  of  approximations  to 
summands  like  (1)  into  subsums,  which  are  approximations  to  the  influence 
of  that  part  of  (1)  associated  with  centers  in  a  single  panel  or  cluster.  The 
key  steps  to  obtain  them  requires: 

•  Finding  explicit  Taylor/Laurent  expansions 

For  each  x,  (j)(\x-x<\)=  ^(x,x<),  |x<;|  <  |x|,  (4) 

with  ^  homogeneous  polynomials  of  degree  m  in 

•  Finding  an  efficient  separation  of  the  x,  x<  influence  in  i.e.,  expanding 

p;;(x,x<)  =  x;/rwsr'(^<).  (5) 


for  some  good  choice  of  (basis)  functions  {fi^{x)}  and  {5r(^)}- 

These  expansion  and  separation  results  provide  the  approximations  to  sub¬ 
sums  which  are  the  far  and  near  field  expansions.  Other  essential  components 
are  the  tools  to  manipulate  these  expansions,  namely  error  estimates,  unique¬ 
ness  theorems,  and  translation  formulae.  In  this  paper  we  concentrate  on  the 
algebraic  tools  and  give  some  extensive  general  results  on  (4)  (Theorem  6) 
and  (5)  ((20)  in  Section  3)  and  for  we  give  the  appropriate  far  and  near 
field  expansions  for  the  (pi  (Theorems  7  and  8),  and  a  brief  indication  of  the 
dual  basis  leading  to  (20).  Analogous  results  for  polyharmonic  splines  in  IR^ 
appear  in  [3].  The  reader  unfamiliar  with  the  framework  of  the  fast  multipole 
method  may  wish  to  refer  to  the  original  paper  of  Greengard  and  Rokhlin  [4], 
or  to  the  introductory  short  course  [2]. 

§2.  Polyharmonic  Functions  and  Homogeneous  Polynomials 

First  we  record  some  detailed  facts  relating  to  the  Laplacian  A,  and  its  actions 
on  special  homogeneous  functions  and  the  logarithm  of  the  distance.  In  par¬ 
ticular,  we  show  why  our  basic  functions  cpi  in  (2)  are  polyharmonlc  or  more 
specifically  (£  +  l)-harmonic  in  the  sense  that  =  0- 

Lemma  1.  Let  |  •  |  be  the  2-norm  on  d  even. 

i)  If  f  :  IR^  \  {0}  IR  is  a  non-trivial  harmonic  function  that  is  homoge¬ 
neous  of  integral  degree  m,  then 


A(|  •  pV)  =  2^ (d  +  2^  +  2m  -  2)  I  • 


(6) 
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Hence  |  •  is  polyharmonic  of  exact  order 

ff+1,  for  i  >  0,m  > or  m  <  1  -  f 

\£  +  m+|,  for  £  <  O^m  >  1  -  f  — 


In  particular 


\2£ 


{(£  +  1) -harmonic 
(£  -f-  d/ 2) -harmonic 


ii) 


for  £>0, 
for  -d/2  <  £  <0, 

1^^  log  I  ■  I  is  (£  -f-  d/2)-harmonic  for  £>0,  More  generally, 

riogi 


A| 


=  I .  |2(^-i)(2£(2^  -f  d  -  2)  log  I  ■  I  -f  (4^  -f  d  -  2)). 


(7) 


Proof:  For  the  first  part  of  (i),  just  apply  the  product  rule  for  the  Laplacian, 


A(fg)  =  iAf)g  -h  2(V/)  •  (Vg)  +  f{Ag), 


and  the  Euler  relation  for  a  function  /  that  is  homogeneous  of  degree  m, 


X  •  (V/)(x)  =  mf{x). 

Then  observe  how  many  applications  of  A  are  required  to  reduce  one  of  the 
multipliers  to  0.  Specializing  to  the  case  /  =  1  yields  the  last  result  of  (i). 

The  first  part  of  (ii)  follows  from  (7)  in  combination  with  (i)  and  its  proof. 
(7)  follows  from  (i),  the  product  rule  for  the  Laplacian,  and  the  computation 
of  V  log  I  •  I  and  A  log  [  •  |  .  □ 

From  the  detailed  eigenvalue-like  information  on  the  Laplacian  map  in 
(6),  we  can  get  a  decomposition  theorem  for  n„,  the  homogeneous  polynomials 
of  degree  n,  in  terms  of  the  spherical  harmonics  of  degree  n: 

=  {p  G  n„  :  Ap  =  0}  =  ker(A)  f!  n„. 


This  is  useful  in  understanding  the  structure  of  the  homogeneous  polynomial 
terms  in  any  Taylor/Laurent  type  series  expansions  for  the  (l>{\x  -  Xi\).  In 
view  of  Lemma  1  the  decomposition  splits  n„  into  its  harmonic,  biharmonic, 
triharmonic,  etc.,  parts. 

Lemma  2.  n„  =  |  •  P^lHn-2^-  In  particular,  ]H„  D  |  •  |^n„_2  =  {0}. 

Proof:  Note  that  for  n  =  0, 1,  Iln  =  Hn,  so  the  base  for  an  inductive  proof 
is  true.  Assume  the  decomposition  for  n  —  2,  some  n  >  2,  so  for  each  p  G  n„, 


L(n-2)/2j 

^P=  XI  I  '  l^^^n-2-2£,  some  hi  G  Mn-2~2e’ 

i=0 


Then  by  (6),  if 

L(n-2)/2j 

h„=p-  -(£  +  l)-\d  +  2n-2{e  +  2)r^|■^^(‘+^h„_2(^+l), 

£=0 

then  Ahn  =  0.  So  G  ]H„  and  the  decomposition  of  !!„  is  proved  by  induc¬ 
tion.  □ 
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Some  additional  consequences  of  (6)  come  when  we  study  what  happens 
for  negative  Here  we  have  noted  that  |  •  j2-2m-dy  jg  harmonic  whenever  / 
is  m-homogeneous  and  harmonic.  But  bringing  the  factor  of  (|  •  |”2^m  jjjgjfjg 
the  m-homogeneous  function  /  shows  |  ■  P~'^/(7I  '  1^)  is  harmonic  for  any 
homogeneous  harmonic  function  /.  In  fact  this  construction  is  independent 
of  the  homogeneity  order  of  /.  In  general  the  Kelvin  transform,  defined  by 
Kf(x)  =  f{x/\x\'^),  which  arises  from  inversion  in  the  sphere  followed 

by  multiplication  by  \x\^~^^  maps  harmonic  functions  to  harmonic  functions. 
On  n„  Kf  =  I  •  |2-ci-2nj  Associated  with  the  Kelvin  transform  are  the  spaces 
of  negative  degree  2-  d-m 

OO  CX3 

0  I  •  =  0  I .  (8) 

^=0  £=0 

which  are  useful  for  analysing  Laurent  (far  field)  series.  An  application  of 
Lemma  1  shows  that  Equation  (8)  displays  the  space  under  consideration 
split  into  its  harmonic,  biharmonic,  triharmonic,  etc.,  parts. 

§3.  Rotation  Invariance  and  Simplified  Taylor  Expansions 

The  decompositions  of  polynomial  spaces  in  the  previous  section  already  sim¬ 
plify  the  Taylor/Laurent  type  expansions  (4)  we  need  to  determine.  To  make 
further  progress,  we  want  to  exploit  the  rotation  invariance  of  </)(|x  -  x,|). 
When  we  come  to  combine  subsums  in  (1),  we  will  want  to  fix  x  and  concen¬ 
trate  on  rotations  (orthogonal  matrices)  which  fix  x.  When  we  are  given  a  pole 
p  G  we  let  Gp  =  {g  :  g  ^  0{d),  gp  =  p}  denote  the  rotations  about  the  ray 
through  p.  So  the  function  f^{x^)  =  (f){\x  -  2;<|)  satisfies  f^{gx^)  =  f^{x^)^ 
for  all  g  €  Gx-  We  refer  to  any  function  /  which  is  unchanged  by  rotations  in 
Gp  as  a  p-zonal  function.  In  particular  we  have  the  p-zonal  harmonics 

=  {/i  G  1H„  :  h{gy)  =  h{y)  for  all  g  G  Gp},  (9) 

and  the  p-zonal  homogeneous  polynomials  11^.  Now  the  Taylor/ Laurent  ex¬ 
pansion  of  as  in  (4),  will  have  j^{x,gx^)  =  ^{gx,gx^)  =  ^(x,x<),  for 
g  ^  Gx  since  the  homogeneous  terms  must  remain  unchanged  under  rotations 
(see  Theorem  6).  Thus  these  terms  will  be  x-zonal  polynomials  as  a  function 
of  x^.  What  is  the  general  stucture  of  and  H^? 

Theorem  3.  Fix  a  pole  x  G  ]R‘^\{0}  .  Let  Xo(-)  =  1,  Xi(')  -  2{j:,-).  Then 
there  exist  a  unique  set  of  constants  Om,  m  >  1,  such  that  the  inductively 
defined  sequence  of  homogeneous  polynomials, 

Xm+l  XlXm  ~  I  ‘  I  Xm— 1>  ^  ^  1>  (10) 

consists  of  harmonic  functions.  Moreover, 
i)  Xm  m-homogeneous  x-zonal  harmonic  function,  which  is  rotation 

invariant  in  the  sense  that  Xmid')  =  Xm(')  9  ^  0(d). 
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Xm(®<)  =  W,  0  ^  X,  a:<  €  IR'', 


(11) 


and  the  Xm(^<)  homogeneous  and  harmonic  as  functions  of  x. 

iii)  Ifx^O,{\’  ^  =  0,...,  [m/2j},  form  a  basis  for  In 

particular,  Xj  is  the  unique  (up  to  a  scalar  multiple)  element  ofMj. 

iv)  For  m,  n  nonnegative  integers,  the  kernels 


krn,£,n{^j  ^<)  — 


_!„l2^+m-K 


|^^l2£+n  ^Xk-2£(^<)^  K  =  min(m,n), 


m  =  n  mod  2,  £  =  0, . . . ,  Lk/2J  , 


(12) 


form  a  basis  for  the  space  of  all  rotation  invariant  polynomial  kernels, 
Pm,n{^,  x<)  which  are  homogeneous  of  degree  m  in  x,  and  degree  n  in  x< . 

Proof;  The  proof  of  the  existence  of  am  and  (i),  (ii),  and  (iii)  is  by  induction. 
For  m  =  0  (i)  and  (iii)  are  trivially  true.  Let  0  ^  h  e  Hi.  Then  h  has  the 
form  h{-)  =  c{p,-).  Since  h  is  rc-zonal  c(p,  •)  =  c{p,g-)  =  c{g~^p,-),  for  all 
g  e  Gp.  This  implies  p  has  the  same  direction  as  x.  Hence  h  is  a  multiple  of 
Xi  and  (i)  and  (iii)  follow  for  m  =  1. 

Now  induction  shows  that  (10)  defines  m-homogeneous  a)-zonal  polyno¬ 
mials  which  are  m-homogeneous  in  x,  for  any  choice  of  Um+i.  Also  they  are 
rotation  invariant.  To  complete  the  inductive  step  for  (i)  with  a  fixed  x  we 
need  only  show  that  there  is  a  unique  a^+i  that  makes  Xm+i(*)  harmonic. 

From  the  homogeneity  in  x,  we  may  assume  |a;|  =  1.  Since  xfXm 
a  homogenous  polynomial  of  degree  m  -1-  1,  Lemma  2  asserts  that  there  ex¬ 
ist  unique  homogenous  harmonic  polynomials  qm-\-i--2£  such  that  XiXm  = 
j^L^+i)/2j  I  _  =  2x'^,  the  product  rule  for  the  Lapla- 

cian  and  the  inductive  assumption  that  Xm  harmonic  show  that 

A(xTx^)  =  •  VxS.  =  4a.x^,  =  1,  (13) 

where  dx  denotes  the  directional  derivative  in  the  (fixed)  direction  x.  Since 
A^xXm  =  =  0)  it  follows  that  XiXm  is  bi-harmonic  and 

A(xTXm)  =  A(|  •  =  2(d  +  2(m  - 

Since  A  maps  cc-zonal  functions  to  a;-zonal  functions  it  follows  that  Qm-i  ^ 
H^_i  and  therefore  by  part  (iii)  of  the  inductive  hypothesis  Qm-i  is  a  multiple 
of  Xm-i*  Thus  the  existence  and  uniqueness  of  Om+i  making  Xm+i  harmonic 
is  proved. 

We  now  turn  to  the  inductive  step  in  the  proof  of  (ii).  Using  the  rotation 
invariance  part  of  the  inductive  hypothesis,  Xm+ii9~^')  ^s 

Xf(s"'-)Xm(s"'')  -  a„+i|xHg-'  •  = 


(14) 
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Since  rotations  and  A  commute,  the  left-hand  side  of  the  above  is  harmonic. 
Thus  the  right-hand  side  of  (14)  equals  Xm-fi  f^m+i  is  independent  of  g. 
Using  homogeneity  it  is  also  independent  of  |x|.  Hence  am+i  is  independent  of 
X.  The  symmetry  in  x^  x^  of  (10)  then  implies  (11),  and  hence  the  homogeneity 
and  harmonicity  of  X^-|-i(x<)  as  a  function  of  x. 

We  now  turn  to  the  inductive  step  in  the  proof  of  (iii).  Since  11^  =  Hm 
and  xT  —  we  may  assume  |2;|  =  1.  It  suffices  to  show  that  dimH^^i  < 
Lm/2J  1  since,  by  Lemma  1,  {|  •  ^  L^/2J}  is  an  inde¬ 
pendent  set  in  Since  we  can  rotate  ei  to  x  by  some  orthogonal  map, 

which  will  isomorphically  map  H^^i  to  H^^j,  we  prove  our  dimensionality 
statement  for  x  =  ei.  To  analyze  the  value  f(y)  of  any  ei-zonal  function 
/,  we  choose  orthogonal  g±  G  Gei  which  transform  y  into  the  coordinate 
plane  spanned  by  the  first  two  basis  vectors.  The  two  possible  values  for  the 
transformed  y  are 

9±y  =  (2/,ei)ei  ± 

Then  f{y)  =  f{g±y)  -  /(2/1,  0, . . .  ,0).  In  particular,  /  must  be 

even  in  its  second  variable.  If  /  G  H^^j, 

L(Tn+l)/2j  ,  - .  2^ 

/(?/)=  Y1  >  for  some  c^. 

Hence  the  functions  {\y\'^  —  ylY  span  H^^j. 

For  (iv)  we  just  note  that  Pm,n(a:,  •)  =  |a^|”^Pm,n(2^/|2^|j  ■)  ^  by  the 

homogeneity  assumptions.  Thus  the  basis  facts  from  (iii)  imply  there  are 
functions  hn^i{xl\x\)  with 

Ln/2J 

t-Q 

The  rotation  invariance  of  Pm,n  and  of  the  terms  implies 

^n,t{9^l\g^\)  —  ^n,t(^l\A)  for  all  rotations  g.  Rotating  xl\x\  to  e\  shows 
^n,(.{xl\x\)  =  hn,i{ei)^  2.e.,  the  hn,i  are  constants.  Moreover,  the  homogeneity 
in  X  of  xj  shows 

Ln/2J 

=  Y  (15) 

Since  the  left  hand  side  is  a  polynomial  of  degree  m  in  x,  and  the  |  ■  Y^Xn-21 
are  independent,  each  Xn-2t  associated  with  a  nonzero  coefficient 

must  be  a  polynomial  of  degree  m  in  x.  Hence,  n  —  m  =  2j  must  be  even. 
Also,  applying  the  second  part  of  Lemma  2,  2(t  -  j)  —  —  (n  -  m)  >  {).  If 

m>n  the  proof  of  (iv)  is  done.  If  m  <  n,  then  reindexing  the  sum  in  terms 
of  —  j)  yields  (12).  □ 

The  following  result  is  known  [5],  but  is  included  for  the  sake  of  com¬ 
pleteness. 
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Theorem  4.  DeSne  a  rotation  invariant  inner  product  (pairing)  for  functions 
on  the  unit  ball  B  C  by 

[f:  h]  =  cv  f  f{y)h{y)dy,  Cy^  =  vol{\y\  <  1}.  (16) 

i)  If  f,  h  are  homogeneous  of  degrees  m,n,  respectively  with  fh^ 

then  [/,  h]  =  0  if  and  only  if  f(y)Ky)<^^  =  integral  of 

fh  over  the  sphere  S‘^~^  is  zero. 

ii)  Ifm^n,  then  \  ■  and  |  •  are  orthogonal  with  respect  to  this 

inner  product  (pairing),  provided  m n  +  2(i  +  j)  >  -d. 

Proof;  For  (i)  just  introduce  polar  coordinates  (r,y)  E  [0,1]  x  5^^“^  Then, 
by  homogeneity  and  scaling  properties  of  the  area  of  {a; :  |a;|  =  r}, 


[/,  h]  =  cv  t  f  f(y)h(y)dAdr,  m  +  n  +  d  >  0, 

Jr=0  J 

and  the  result  follows.  By  (i)  it  suffices  to  prove  (ii)  when  i  =  j  =  0,  since 
integrals  of  a  product  |  •  on  do  not  depend  on  i,j.  Let  /  G 

and  h  E  1H„.  Then  by  Green’s  Theorem  and  the  Euler  relation  for 
homogeneous  functions 


0  = 


/(y)Aft(y)  -  %)A/(j/))dy 
/(v)VMy)  -  %)V/(y))  ndA 


L 


{|yNi} 


(n  -  m)f{y)h{y)dA. 


Thus  f{y)Ky)^^  -  0:  analogous  relation  holds  for  integration 

over  the  ball  by  part  (i).  □ 

An  application  of  the  above  gives 

Lemma  5.  The  constants  am,  m>  2,  in  the  3-term  recurrence  (10)  dehning 
the  x-zonal  harmonics  Xm  Theorem  3  are  positive. 

Proof;  By  Theorem  4  and  (10), 


0  —  [Xm+l>Xm-ll  ~  [XiXttu  Xm-l]  a^n+lil  *  |  Xm-ljXm-l] 
=  [Xm,XlXm-ll  -"m+lll  •  PXm_l,Xm-l] 

~  [Xm>  Xm  d”  ‘  !  Xm— 2]  “  •  |  Xm— Ij  Xm-ll 

[Xm J  Xm]  [|  '  |  Xm— iJ  Xm— ll" 


Hence,  Om^l  =  [XmiXm]/[|  *  PXm-l>Xm-l]  >  0.  □ 

Now  part  (iii)  of  Theorem  3  leads  quite  directly  to  the  structure  of  near 
and  far  field  expansions  of  general  rotation  invariant  kernels  ^(a;,a;<).  The 
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heuristic  that  a  far  field  expansion  of  '0(x,x<)  with  respect  to  x  can  be  found 
from  a  Taylor  expansion  with  respect  to  x<,  has  been  known  to  us  for  some 
while.  Theorem  6  below  gives  a  proof  that  the  underlying  idea  is  correct 
inimportant  special  cases.  In  fact,  we  have  the  following  result  for  such  ■0 
which  are  jointly  homogeneous  (■0(aa:,  aa;<)  =  x<))  for  some  even 

integral  power  and  are  analytic  about  =  0,  such  as 

i)n(x,x^)  =  |x  -a:<p”(log(|a:  -  x<p)  -  log(|a:|2)).  (17) 

Theorem  6.  Let  x,x^  G  IR^  be  rotation  invariant,  jointly  homo¬ 

geneous  of  degree  2n  and  analytic  in  x,x^,  for  |a;<|  <  |x|.  Then  there  exist 
constants  c^  ^  such  that  the  Taylor  expansion  of  ip  about  x<  =  0  has  the  form 

oo 

t(x,xj='£  E  (18) 

m=0  £=0 
oo  Li^/2J 

=  E  E  (19) 

m=0  ^:=0 

When  xjj  is  {k  +  l)-harmonic  in  x^,  the  upper  limit  on  £  in  (18)  or  (19)  is 
min{A:,  [m/2j}.  Jf '0(a;,  0)  =  0  then  the  lower  limit  on  m  in  (18)  or  (19)  is  1. 

Proof:  The  terms  pjji{x,x^)  in  (4),  the  Taylor  series  of  ‘ip{x,  a:<)  with  respect 
to  a;<,  are  degree  m  homogeneous  polynomials  in  a:<.  When  any  Taylor  se¬ 
ries  is  grouped  by  homogeneity  with  respect  to  a:<,  each  group  is  uniquely 
determined.  Since  only  the  term  ^{gx,gx^)  in  the  series  for  'tp{gx,gx^)  has 
homogeneity  m  in  the  rotation  invariance  implies  that  ^  is  also  rota¬ 
tion  invariant.  Similarly  the  joint  homogeneity  of  V’  yields  ^[ax,ax^)  = 
o^^Pm{x^x<).  Since  for  any  x,x^  there  is  a  rotation  g  (or  reflection  if  d  =  2) 
which  interchanges  the  rays  through  x  and  i.e.,  g{xl\x\)  =  (a:</|a:<|)  and 
g{x^l\x<^\)  =  (a7/|(c|),  it  follows  that 

=  ^^<1  (^)  "  \x<?^‘^~'"^P^{x^,x). 

Since  the  final  right  side  in  this  string  of  equalities  is  an  m-homogeneous 
polynomial  in  x,  we  see  that  the  terms  in  (4)  have  the  form  x^ ) 

with  Pm  a  rotation  invariant  m-homogeneous  polynomial  in  each  of  x,x^. 
Hence  (18)  follows  by  Theorem  3.(iii).  □ 

The  separation  properties  in  (5)  can  now  be  achieved  by  further  use  of 
rotation  invariance.  Each  of  the  subspaces  |  ■  j  -\-2£  =  n,  which  occur 

in  the  decomposition  of  Hn  is  rotation  invariant.  Hence  it  has  a  (unique) 
rotation  invariant  reproducing  kernel 

dim  Hj 

x?‘\y\‘^‘  E 

i=0 


k(x,y) 


(20) 
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where  {[  •  P^/i}  and  {|  •  are  any  bases  for  this  subspace  which  are  bi- 

orthogonally  dual  with  respect  to  some  rotation  invariant  inner  product,  e.g. 
the  inner  product  (16)  (see  [8].)  But  by  (12)  in  Theorem  3.(iii),  since  k{x,y) 
is  (exactly)  {£  +  l)-harmonic  as  a  function  of  y,  and  is  homogeneous  of  degree 
2i  +  j  in  both  x,y, 

k(x,y)  =  Cj^e\xf%\‘^^Xj{y)^ 

for  some  cj^i  >  0.  Equating  (20)  and  (21)  provides  separation  of  the  influence 
of  X,  x<  in  the  expression  for  x|(^<)-  A  consequence  is  separation  of  x,  x< 
in  the  far  and  near  field  expansions  given  by  Theorem  6,  thus  allowing  the 
combination  of  the  expansions  for  several  centers  x<  =  Xi^i  =  1, . . . ,  A^,  into 
one  expansion  about  0. 


§4.  Expansions  in 

In  this  section  we  use  the  results  of  Section  3  in  the  case  to  outline  the 
explicit  expansion  formulae  for  the  4>i  in  (2)  for  d  =  4.  We  start  with  the  far 
field  expansion  of  the  potential  function  |x  -  x<|~^. 

Theorem  7.  For  x,x<  e  with  |x<|  <  |x|, 

lx  -  x<|-2  =  f;  (x),  c„  =  1.  (22) 

m=0 

Proof:  Since  \x  —  x<|“^  is  harmonic  in  R'^,  an  expansion  of  this  form  holds 
for  some  constants  c„^  by  Theorem  6.  Using  (x)  =  Xm(^<)j  multiplication 
by  |x  -  x<p  =  |xl^  +  |x<p  -  xT(2^<)  yields 

oo 

m=0 

+  (Cm~2  -  Cm-iam)\x\~'^^^~'^'>\x^\'^Xm-2i^<)^ 

when  the  recurrence  (10)  is  used  and  the  geometrically  convergent  expansion 
is  rearranged  to  group  terms  of  common  homogeneity  in  x<.  Then  equating 
coefficients  using  (iii)  of  Theorem  3  shows  cq  =  1,  Cm  =  and  Cm-2  = 
Cm-io>Tn-  These  must  be  consistent  so  =  1  for  all  m.  □ 

We  now  outline  the  expansion  of  '0„(x,x<)  from  (17).  This  gives  us  the 
bulk  of  the  far  field  expansion  for  (^n+2- 

Theorem  8. 

oo  min{Ti+l,Lm/2j} 

i,„(x,x<)=Y  E  (23) 

m=l  £=0 

where  the  non-zero  coefficients  c!^  ^  are  given  by  the  formulae  ^ 
and  the  recurrence  =  c^^^  -  -  c”  +  cl^_2,t-v 

Proof:  The  form  of  all  the  expansions  follow  from  (18),  since  '^„(x,0)  =  0. 
The  explicit  determination  of  the  £,  the  n  =  0  case,  is  done  in  Lemma  4.4 
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of  [1].  The  recurrence  for  the  follows  as  in  Theorem  7  from  (10)  with 
=  1  upon  multiplication  of  the  V>n  Ccise  by  \x  -  The  details  are  in 

Lemma  4.6  of  [1].  □ 

The  explicit  construction  of  bases  for  (and  dual  bases)  which  are 
needed  for  the  separation  results  can  also  be  significantly  simplified  by  use  of 
the  rotation  invariance  perspective.  A  detailed  development  in  is  in  our 
previously  cited  work. 
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Constructive  Approximation  by 
(V,f)-Reproducing  Kernels 

Mohammed-Najib  Benbourhim 


Abstract.  In  this  paper  we  propose  a  constructive  method  to  build 
reproducing  kernels.  We  define  the  notion  of  fV,  /Vreproducing  kernel, 
and  prove  that  every  reproducing  kernel  is  a  fF, /^reproducing  kernel. 
We  study  the  minimal  approximation  by  these  (V,  /)-reproducing  kernels 
for  different  choices  of  V  and  /.  Examples  to  which  our  results  apply 
include  curve  and  surface  fitting. 


§1.  (V,f)-Reproducing  Kernels 

For  any  set  (respectively  locally  compact  set)  we  denote  by  (respectively 
C”^(n))  the  space  of  real- valued  functions  (respectively  m-times  continuously 
differentiable  functions)  defined  on  equipped  with  the  topology  of  pointwise 
convergence  (respectively  uniform  convergence  on  the  compact  subsets  of  fi) . 
Let  us  recall  some  definitions. 

Definition  1.1.  A  real-valued  function  H  defined  on  fl  x  H  is  a  reproducing 
kernel  on  O  x  O  if 

1)  H  is  symmetric:  H{t,s)  =  -ff(s,t)  for  all  t,  s  E  fi, 

2)  If  is  of  positive  type: 

k,l=N 

^  >  0, 

k,i=i 

for  any  finite  point  set  {tk}k=i  real  numbers 

Definition  1.2.  A  vector  subspace  Ji  oflRp  is  said  to  be  a  hilbertian  subspace 
oflR^  (respectively  if 

1)  7i  is  a  Hilbert  space, 

2)  The  natural  injection  from  H  into  (respectively  €"^(0.) )  is  continuous. 

We  review  some  important  results  on  reproducing  kernels  which  are  studied 
in  [4]. 
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Theorem  1.1. 

1)  A  Hilbert  space  H  (respectively  a  real-valued  function  H  defined  onflxfl) 
is  a  hilbertian  subspace  of  (respectively  a  reproducing  kernel  onflxQ) 
if  and  only  if  there  exists  one  and  only  one  reproducing  kernel  H  on  fix  Q, 
(respectively  hilbertian  subspace  H  of  such  that 

u(t)  =  {u\H{-,t))n,  yteU, 

7i  is  called  the  hilbertian  subspace  associated  with  H . 

2)  For  any  hilbertian  basis  ofH:  H{t,s)  =  fi{t)fi{s). 

i^I 

3)  If  H  is  separately  m-times  continuously  differentiable,  then  H  is  a  hilber¬ 
tian  subspace  of  C^ifl). 

4)  The  vector  space  Ho  =span{[H{-,t))^^^}  is  dense  in  H. 

Let  (V,  (■  I  •)v)  be  a  Hilbert  space,  be  a  set  and  /  be  a  function  from  fl  into 

y. 

Definition  1.3.  For  all  f  :  ft  — ►  V,  we  define  a  (V,  /)-reproducing  kernel  Hj 
=  {f{t)  I  /(s))v,  V(t,s)  eflxfl.  (1.1) 


We  have  the  following  result: 

Theorem  1.2.  Hj  defined  by  (1.1)  is  a  reproducing  kernel  on  fl  x  fl  and  its 
associated  hilbertian  subspace  Hf  of  is 

Hf  =  e  R^  I  G  y  :  w(t)  =  {u  \  f{t))v,  Vi  G 


Proof:  One  can  easily  verify  that  Hj  is  a  reproducing  kernel. 

Let  Hf  :  V  — >  R^  be  defined  by  {Hfu){t)  =  {u  \  f{t))y.  The  mapping  Hj 
is  linear,  and  the  inequality 

I  I  ^  I  ^  k  I  f{f)  \v=\  u  \v  if/(i,i)2, 

for  all  i  G  n  and  for  all  u  G  V  implies  that  it  is  continuous.  Let  M  be  the 
closure  in  V  of  the  vector  space  span{(/(i))^^j^},  and  Pj^  the  orthogonal 
projector  on  M.  We  define  on  Hf  =  Hf{V)  the  bilinear  form 


{Hfu  I  Hfv)nf  =  {Pm'^  I  Pm'^)v‘ 

It  is  easy  to  see  that  this  form  is  a  scalar  product  on  Hf.  Then  the  linear 
mapping  Hf  :  M  — >  Hf  is  an  isometry,  and  consequently  (Hf,  {•  \  is  a 
Hilbert  space.  For  all  f  G  H,  the  function 
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Hf{t,  s)  =  im  I  f{s))v, 

is  an  element  oiHf,  and  satisfies  the  reproducing  formula 

iHfu)it)  =  {Hfu  I  Hf{t,-))nr  Vu  G  K 

Consequently  (see  Theorem  1.1),  is  a  hilbertian  subspace  of  and  admits 
Hf  as  reproducing  kernel.  □ 

Theorem  1.3.  Let  co  be  a  set  and  c  a  mapping  from  oj  to  Q.  Then 


(Ec)f(y,z)  =  Iff(c(y),c(z))  =  {f(c(y))  |  f(c(z))}v 
is  a  reproducing  kernel  on  cj  x  lv. 

Proof:  For  all  y  E  f(c(y})  is  in  V.  The  function  {Hc)j:  is  symmetric  and 
is  of  positive  type: 

k,l=N  k=N  k=N 

^  AjfcAi(i?‘c)/(?/fc,?/f)  =  {^  Afe/(c(2/fc))  I  X]  Aft/(c(2/fc)))v^  >  0.  □ 
k,l=l  k=l  k=l 

Example  1.1.  Let  V  =  L‘^{a,b),  0  a  subset  of  IR  and  f{t){x)  =  exp(ca:t) 
where  c  is  a  real  constant.  Then 

r  (exp(c6(t  +  s))  -  exp{ca(t  +  s)))/{c(t  +  s)),  if  (t  +  s)7^0, 
Hf{t,s)  =  <  ,  . 

lb  —  a,  otherwise. 

Example  1.2.  Let  V  =  L^(1R‘*’),  and  suppose  O  is  a  subset  of  R”. 

For  all  functions  c  :  ft  — ^  (0,  +oo),  we  have 

(i)  If  f{t){x)  =  then  Hf{t,s)  =  ^  ■ 

•y'c(t)  +  c(s) 

,  and  in  particular  if 


c(t)  +  c(s) 


(ii)  If  f{t){x)  =  exp  then  Hf{t,s)  = 

P(t) 

c{t)  =  (with  P{t)  and  Q{t)  polynomials),  we  obtain  the  rational  repro- 
Q{t) 

ducing  kernel 


Qmis) 


P{t)Q{s)-\-Pis)Q{t) 


§2.  (V,f)-Reproducmg  Kernels  of  Convolution  Type 

We  consider  the  case  where 

1) V  =  L2(]R")  and  =  R". 

2)  f{t)(x)  =  f{t  —  x),  with  /  in  the  familar  Sobolev  space  ^Z'"^(R”). 
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Theorem  2.1.  We  have  the  following  properties: 

1)  =  h,(t  -  s)  with  hfd)  =  if  *  fm  =  /-(i  /•/  p)(0,  where 
f(^x)  =  f{—x)  and  Tf  is  the  Fourier  transform  of  f. 

2)  hf  €  CJ>(R")  =  {«  e  C'”(1R'*)  I  I  Urn  (D°u)(t)  =  0,  0  <|  a  |<  m}  . 

3)  The  associated  hilbertian  subspace  of  Hf  is 

Tif  =  f  *  L^(]R”)  CJ"(]R")  {continuous  embedding). 


4)  In  particular,  if\iFf\>Q,  then 


nf  =  {wes'iy^me  lL(k"),  ^  e 


equipped  with  the  scalar  product 

(tei  I  W2}n/  =  / 
Jr^ 


5)  If  f  is  radial,  then  hf  is  radial:  Hf(t,s)  =  hf{\t  —  s\). 

6)  For  all  distinct  points  in  K”,  the  matrix  =  {Hf{tk,ti))^^^ 

is  invertible  (strictly  positive  deSnite). 

Proof: 

1)  We  have 

H,{t,  s)  =  {2ir)-i  f  |  =  J^(|  JT/  |=>)(t  -  s). 

Jr^ 

2)  f  e  ^  D^hf  =  {D^f)  *  /  G  CJ(]R")  for  0  <|  a  |<  m,  (see  [2]). 

3)  is  a  consequence  of  Theorem  1.2  and  the  property  given  in  1). 

4)  Since  Hf  ^  Co”(]R”)  <S  ,  we  have  the  equivalences: 

{w  €  Hf}  {3k  G  :  Tw  =  TuTf} 

«■  {J^W  e  iL(lR").  ^  e  L^(lR")}- 

From  Theorem  1.2,  we  have 

u;(t)  =  {u  I  f{t  -  •))l2{r-)  =  {Pm'^  I  /(*  -  •))l=(r-)  =  (Pmu  *  /)(0* 
Then  Tw  =  PPm  '^P f  i 

(wi\w2)Hf-j  PM'^\{^)PM'^2{x)dx 


(V,f)-Reproducing  Kernels 
5)  For  any  orthogonal  matrix  A, 
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hf{At)  =  (  f{x)f{At-x)dx=  (  f{Ax)f{A{i~x))dx 
Vr"  JR" 

=  /  f(x)f{t-x)dx  =  hf{t), 
jR" 


since  f{Ax)  =  f{x)  and  |  detA  |=  1. 

6)  We  suppose  that  /  ^  0  in  L^(]R").  Since  the  matrix 


Hn  =  {  I  f{tk  —  x)f{ti  —  x)dx)  is  a  Gram  matrix,  it  is  invertible 

^  1<A:,Z<JV 

if  and  only  if  the  system  {/(t*  —  •)}£Li  lineary  independent  in  L^(]R”).  If 
k=N 

Ckf(tk  -  x)  =  0  in  L^(1R”),  for  G  IR,  1  <  A;  <  iV,  then  by  the  Fourier 


k=l 


k=N 


transform  we  get  (  =  0  in  L^(]R"). 


jk=l 


k=N 

The  Lebesgue  measure  of  the  set  jV"  =  {^  G  IR”  |  =  0}  is  equal 

k=l 

to  zero.  Then  vanishes  outside  A/”,  i.e:  !Ff  =  0  in  L^(1R”)  and  by  the 
inverse  Fourier  transform,  /  =  0  in  I/^(]R”),  which  complete  the  proof.  □ 


Example  2.1.  Let  u{x)  =  (1—  |  x  |)+  and  Tu  —  v.  We  have  v{x)  = 

(i)  Taking  f  =  T{\u\^),  Hfit,  s)  =  {l-\t-s  |)+. 

(ii)  Taking  f  =  T{\v\^),  Hf{t,s)  =  • 


Example  2.2.  (  Bessel  reproducing  kernels)  For  n  G  IN,  o  G  IR  and  a  >  n; 
consider  Ga  €  L^(]R”)  defined  by  T(Ga){x)  =  (1  +  |  x 

(i)  Taking  /  =  T(\  Ga  1^),  =  (1+  1 1  -  s  |2)-f . 

(ii)  Taking  /  =  ^(|  TG„+i  |^),  Hf(t,  s)  =  exp(-|  t  -  s  |)  and 

W/  =  (Sobolev  space). 


Example  2.3.  (i/-B-spline  reproducing  kernels)  Let 

1)  y;(x)  =  j^x\. 

2)  V  £  S'  (distributions  with  compact  support)  such  that  v{p)  =  0,  for  all 
polynomial  p  in  'Pi(]R). 

3)  f  =  u*Yi. 

For  such  functions  /,  we  give  the  following  theorem  without  proof. 
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Theorem  2.2.  We  have: 

1)  fe  L^(R). 

2)  For  all  u  in  =  {v  e  Lf^cO^)/  ^  (Beppo-Levi 

space)  we  have  /  -  x)dx  =  {u  *  u){t). 

Jn 

3)  Hf{t,  s)  -  (-l)^(i>  *  *  >2J+i)(i  -  s)  and  Uj  =  v  *  \/'+^(]R). 

In  the  particular  case  of  divided  differences,  v  is  defined  as  the  mth-iterated 
convolution  z/  =  and  i>  =  ' 


§3.  Data  Fitting  by  (V,f)-Reproducing  Kernels 


Let 

Hf  into  TRr 


a  set  of  distinct  points  in  f2,  and  define  a  linear  operator  An  from 
by  An{u)  = 


Definition  3.1.  For  all  zn  £  and  (  6  |0, 1)  we  define  a  spline  to  be  any 
solution  of  the  following  minimal  approximation  problem: 


{p.M)  ■■ 


inf  ^(1  -  (){u  I  u)n,  +  -  z/v||rk), 


where 


Aj^^{zn},  if  e  =  0  (Interpolation), 
Hf,  if e  €]0, 1[  (Smoothing). 


The  following  theorem  gives  the  spline  in  the  case  e  /  0. 

Theorem  3.1.  For  all  (c^zn)  eJO,  l[xlR^,  the  problem  Pe(zN)  (Smoothing) 
admits  a  unique  solution 


k=N 


fc=l 


where  the  coefficients  =  (Af, . . . ,  A^)  G  IR^  are  the  solution  of  the  system 
(Hn  +  ~  ^N, 

with  Hn  =  (Hf(tk,ti))i<k,i<N  and  In  is  the  identity  matrix. 

Proof:  1)  Prom  the  continuous  embedding:  Hf  IR^  (see  Theorem  1.2), 
we  deduce  that  An  is  continuous.  2)  AN(Hf)  is  closed  as  a  vector  subspace 
of  ]R^.  Then  from  the  general  spline  theory  (see  [1,3])  we  get  the  theorem.  □ 
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Theorem  3.2.  The  following  two  properties  are  equivalent: 

1)  For  all  zn  G  the  problem  Po{zn)  admits  a  unique  solution. 

2)  The  system  is  linearly  independent  in  V. 

Proof:  For  all  G  the  problem  Po{zn)  admits  a  unique  solution  if  and 
only  if  the  matrix  Hn  =  invertible  (see  [1,3]).  Since  the 

matrix  Hn  =  |  /(^0)v^)i<fe is  a  Gram  matrix,  it  is  invertible  if 

and  only  if  the  system  {/(tfc)}fcLi  is  lineary  independent  in  V.  □ 

Furthermore,  for  the  particular  case  V  =  L^(1EI”)  and  /  G 
Theorem  3.2  and  the  property  (6)  of  Theorem  2.1  imply  the  following  theorem. 

Theorem  3.3.  For  all  f  G  and  zn  G  the  problem  Po(^jv) 

(Interpolation)  admits  a  unique  solution 

k=N 

k=l 

where  the  coefficients  =  (Aj, . . . ,  A^)  G  IR^  are  the  solution  of  the  system 

Hn^%  =  zpj, 

with  Hn  =  {Hf{tk,ti))i^k,i<N  and  In  is  the  identity  matrix. 

§4.  Data  Fitting  Preserving  Polynomials 

Let  the  vector  space  of  polynomials  of  degree  at  most  d.  We  suppose: 

(HI)  For  all  p  G  Pd(lR”)  the  subset  {tk}k=i  of  iR”  is  such  that 
{p(^A:)  =  0,  1  <  <  A^}  p  =  0. 

(H2)  = 

We  remark  that  in  the  case  V  =  L^(IR")  and  /  G  //"""(IR”),  the  hypothesis 
(H2)  is  satisfied  because  Hf  C  C^(1R”)  (see  Theorem  2.1(2)),  and 

C?^(]R”)nPd(]R")  =  {0}. 

Let  Hf  be  the  Hilbert  direct  sum:  Hj  =Hf  0Pd(lR”).  We  denote  by  11/ 
the  orthogonal  projector  from  Hj  onto  Hf,  and  we  define  on  Hj  the  linear 
mapping  An{u)  =  {u{tk)) €  IR^.  For  all  {€,zn)  G  [0,1]  x  K^,  we 
consider  the  following  minimal  approximation  problem  in  Hf : 

(Pe(ziv))  :  ((1  -  e){Il,{u)  \  Ilf(u))n,  +  4Anu  - 

where 

if  e  =  0  (Interpolation), 

Ce  =  *  Hf,  if  e  g]0,  1[  (Smoothing), 

^  'Pd(lR’^),  if  e  =  1  (Least  squares). 

The  hypothesis  (HI)  implies  that  the  problem  Pi{zn)  admits  a  unique  solu¬ 
tion.  In  the  case  e  g]0,  1[  (Smoothing)  we  have  the  following  theorem: 
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Theorem  4.1.  For  all  (e,  e]0,  l[xR^,  the  problem  Pe(zjv)  admits  a 

unique  solution  In  the  case  e  €]0, 1[,  the  solution  is  given  by 


k=N 


k=l 


i=l 


where  the  coefficients ‘A‘  =  (Aj, . . . ,  Xj^)  €  IR^  and*B‘  =  (6j, . . . ,  b‘^^)  6  E,”'* 
are  the  solution  of  the  system 


{Hn  +  ‘-^In 

V  ‘E 


with 

1)  Hn  =  (f^f{tk,ti))^<k,l<N  identity  matrix, 

2)  E=  (Pk,i)l^i^na  with  Ek,i  =  Pi{tk)  and  (pi)i<i<nd  Is  a  basis  ofPd(Il”). 

In  particular,  if  there  exists  p  6  Vcl(]R^)  such  that  {p{tk)  =  ZN,ki  1  ^  ^  ^ 
then  i=p  (preserving  polynomials  property). 

Proof:  Theorem  4.1  is  a  consequence  of  general  spline  theory  (see  [1,3]); 

1)  An  is  continuous  since  Hj  is  a  hilbertian  subspace  of  IR^. 

2)  II/  is  continuous  and  11/ (H/)  =  Hf  is  closed  since  11/  is  an  orthogonal 
projector. 

3)  kerA^v  H  kerll/  =  {0}:  derives  from  the  hypothesis  (HI)  and  the  fact 
that  kerll/  =  Fd(lR’^). 

4)  ker^iv  +  kerH/  is  closed  since  kerH/  is  a  finite  dimensional  vector  space. 

n 
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Adaptive  Wavelet  Galerkin  Methods 
on  Distorted  Domains: 

Setup  of  the  Algebraic  System 


Stefano  Berrone  and  Karsten  Urban 


Abstract.  We  use  the  algorithm  of  Bertoluzza,  Canuto  and  Urban  [2] 
for  computing  integrals  of  products  (of  derivatives)  of  wavelets  in  order  to 
solve  elliptic  PDEs  on  2D  distorted  domains.  We  construct  a  variant  of  the 
original  method  which  turns  out  to  be  more  efficient.  Several  numerical 
results  are  presented. 


§1.  Introduction 

Adaptive  wavelet  Galerkin  schemes  have  quite  recently  been  proven  to  offer 
great  potential  for  numerically  solving  boundary  value  problems  for  partial 
differential  equations.  On  the  one  hand,  strong  analytical  properties  such 
as  convergence  and  optimal  efficiency  have  been  proven  for  elliptic  operators 
[6,9].  On  the  other  hand,  first  numerical  tests  also  on  non-tensor  product 
domains  indicate  the  applicability  of  such  methods,  [1]. 

However,  the  major  obstacle  so  far  is  the  efficient  computation  of  the 
entries  of  the  stiffness  matrix  and  the  right-hand  side  of  the  corresponding 
algebraic  systems.  In  fact,  it  turns  out  that  these  entries  are  more  expensive  to 
compute  than,  e.g.,  in  the  case  of  adaptive  Finite  Element  Methods.  In  [2],  a 
method  to  adaptively  approximate  and  compute  these  entries  was  introduced 
and  analyzed;  numerical  results  were  given  for  a  ID  example.  In  this  paper, 
we  study  the  application  of  the  algorithm  in  [2]  for  2D  ‘distorted’  domains, 
which  are  parametric  images  of  the  unit  square.  This  allows  the  study  of  the 
influence  of  ‘realistic’  parametrizations  of  non-tensor  product  domains  on  the 
assembling  of  the  algebraic  system.  We  incorporate  some  improvements  over 
the  original  method  in  [2]  to  increase  efficiency,  and  present  various  numerical 
results. 
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Ji  U1 

/  bo,,p{x)D°^u{x)D^v{x)dx 

Jn 


or 


(1) 


§2.  Adaptive  Approximation  of  the  Algebraic  System 

Given  a  linear  boundary  value  problem  in  a  bounded  Lipschitz  domain  Q  C 
H”  (n  >  1),  its  numerical  approximation  by  a  variational  method  (Galerkin, 
Petrov-Galerkin,  weighted  residuals,  ...)  requires  the  computation  of  integrals 
of  the  form 

/  fp{x)D'^v{x)dx, 

Jn 

where  a,/?  G  W”  are  suitable  multi-indices, 

- u  and  v  are  suitable  trial  and  test  functions  belonging  to 

and  respectively,  ba,p  G  L°°{n)  and  fp  G 

We  consider  a  wavelet  Galerkin  method  with  trial  and  test  spaces  Sa  = 
span  generated  by  adaptively  choosing  a  finite  subset  =  {i/^a  :  A  G  A} 
within  a  wavelet  basis  ^  =  {'ipx  :  A  G  JT}  in  i.e.,  K  C.  J  (see,  e.g., 

[5,10]).  The  wavelets  are  assumed  to  have  the  appropriate  regularity  for  the 
above  integrals  to  be  well  defined. 

The  construction  of  such  wavelet  bases  on  fairly  general  domains  ft  is  not  a 
trivial  task.  However,  quite  recently  significant  progress  has  been  made  on  this 
topic,  see  [3,4,8,12]  and  also  [13]  for  a  somewhat  different  approach.  The  main 
idea  behind  all  the  constructions  in  the  first  cited  papers  is  domain  decom¬ 
position  and  matching.  The  domain  ft  is  subdivided  into  N  non-overlapping 
subdomains  fti.  Each  subdomain  is  mapped  to  the  n-dimensional  reference 
cube  Cl  :=  [0, 1]""  by  means  of  smooth  parametric  mappings 

ni  =  Fi{h),  Gi  -F-\  (2) 

Then,  each  A  G  J",  restricted  to  Ui  is  the  image  through  Fi  of  a  linear 
combination  of  tensor  product  wavelets  'ipx  on  Cl,  i.e.,  if  A  =  (i,^)  [j  |A| 
denoting  the  level  and  k  the  location  in  space  as  well  as  the  type  of  wavelet), 
then 

i’x{x)\ni=  X!  ^A'(^»(^))’ 

A'65{i,A) 

where  S{i,X)  is  a  suitable  set  of  indices  of  the  form  A'  =  {j,k')  with  k'  €  Cl 
and  7^,  .  are  suitable  coefficients  independent  of  j  (see  e.g.  [3,4]). 

2.1.  Reduction  to  univariate  integrals 

We  will  only  consider  the  calculation  of  the  integral  on  the  left-hand  side  of 
(1)  which  enters  into  the  stiffness  matrix.  The  entries  for  the  right-hand  side 
are  treated  analogously,  [2].  Hence,  replacing  u  by  ‘ipx  and  v  by  on  the 
left-hand  side  in  (1)  for  some  \,fjL  e  A,  we  get 

N 


f 

ax,^L■=  /  6a,/3(a^)^"V’A(a^)£>^'0MW^^  =  X] 

'=1  a'eT(i,a).  X'eS{i,X), 
0'eT[i,0) 

Jn 


Adaptive  Wavelet  Galerkin  Methods 


67 


where  the  sets  T{i,a),T{i,j3)  are  defined  by  the  chain  rule,  and  da>  and  dp> 
are  smooth  functions  depending  on  Gi  and  its  derivatives.  The  integrals  which 
appear  on  the  right-hand  side  of  (3)  take  the  form 

f  ......  .  ”  . 

•=  /.  where  =  H  W 

i=i 

and  i9^,  are  univariate  scaling  functions  or  wavelets  on  [0,1].  Now,  we  use 
the  Two-Scale-Relation  for  the  wavelets  to  express  them  in  terms  of  scaling 
functions  on  the  next  higher  level,  i.e., 

^a(*)  =  ™A,An(^)-  (5) 

where  are  the  refinement  coefficients.  Here,  the  index  set  C  is 

determined  by  the  Two-Scale-Relation  and  Ij  denotes  the  set  of  all  scaling 
function  indices  on  a  level  j.  Hence,  d^  ^  becomes 

'^A,m=IZ  ™A,A’"/i.A /.  (6) 

A€Aj  AEAs 

The  computation  of  each  integral  on  the  right-hand  side  of  (6)  would 
be  highly  efficient  if  we  could  reduce  it  to  a  product  of  univariate  integrals, 
but,  in  general,  the  function  c  is  not  a  tensor  product  of  univariate  functions. 
However,  we  can  expand  it  in  an  appropriate  tensor  product  wavelet  basis 

0-  :=  [ei  ;  r,(x)  =  n  0  e  S'],  (7) 

i=:l 

(where  iO’^.  are  again  univariate  scaling  functions  and  wavelets  on  [0, 1],  re¬ 
spectively,  possibly  different  from  as  follows 

(8) 

Then,  we  approximate  c  locally  on  ^  :=  supp  D  supp  by  a  finite 

sum  (Ja*  c,  obtained  by  restricting  the  sum  in  (8)  to  a  finite  index  set  A*  C  J* 
(depending  on  c  as  well  as  on  d,  /3,  A,  /i,  A  and  /i),  whose  precise  definition 
will  be  given  below.  Correspondingly,  d^  ^  is  approximated  by 

H  ™a,a™A,A  /  (9) 

ASAs  AeAii  70 
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which  is  a  finite  linear  combination  of  products  of  univariate  integrals  of  the 
following  form 


i  =  1, . . .  ,n. 


(10) 


An  algorithm  for  computing  such  integrals  can  be  found  in  [2].  However,  here 
we  use  biorthogonal  B-spline  wavelets  [7,11]  as  trial  and  test  functions.  Due 
to  their  explicit  representation,  efficient  direct  formulas  for  the  integrals  (10) 
are  available  and  have  been  used  for  the  subsequent  numerical  experiments. 

Let  us  mention  that  the  above  strategy  slightly  differs  from  [2]  since 
here  we  approximate  c  locally  on  whereas  in  [2]  this  is  done  on  the 

somewhat  larger  domain  :=  supp-^j^n  supp-^^.  This  new  method  ensures 
automatically  that  only  non-zero  integrals  are  computed,  avoiding  a  wide 
number  of  checks,  which  explains  why  the  present  method  is  more  efficient 
than  the  original  one. 


2.2.  Adaptive  approximation  of  the  stiffness  matrix 

Now,  we  are  going  to  describe  the  construction  of  the  index  set  A*  introduced 
above.  To  this  end,  we  have  to  introduce  some  notation.  Let  us  set 

T(A,A)  :=  •  |supp  n  supp  n  supp  >  0},  (11) 


and  j  :=  min{|A|,  |A|}  as  well  as  J  :=  max{|A|,  |A|}-  Let  Ro  be  the  number 
of  zero  moments  of  [5,10].  Moreover,  let  and  be  the  largest  integers 
such  that  e  W^^^’°°(n)  and  §1  e  respectively.  Then,  we  set 

R  :=  mm{Ri>,Tx  -  ||d||,T^  -  ||^||}. 


We  make  the  following 

Assumption  1.  The  system  Q*  defined  in  (7)  allows  the  characterization  of 
the  Besov  space  for  indices  (a,  q)  in  a  certain  range  Sq*  C  iR"*"  x  (0, 1], 

i.e.,  the  Besov  seminorm  |  •  has  the  representation 

/  \ 

~  (  Z  j  .  V  e  (12) 

The  following  notation  will  be  frequently  used  in  the  sequel.  For  t  = 
1, . . . ,  L,  we  consider  (possibly  different)  wavelet  bases  '  A^  G  tJ}^ 

Then,  for  A^  G  £  J',  ^  =  1, . . . ,  L,  we  define 


i(Ai,. ..  ,Al) 


1,  if  inLiSupp  >  0, 

0,  otherwise. 
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Now,  we  are  in  a  position  to  define  the  set  A*.  Let  us  fix,  once  and  for  all, 
independently  of  A  and  £,  a  non-increasing  £^(jCVo)-sequence  6  = 
with  strictly  positive  elements,  whose  ^^(IVo)-norm  is  close  to  1.  Let  c  G 
some  (<7,  g)  G  Sq*.  For  an  index  i>  ^  define  its  relevance 

for  the  computation  of  co  dx  as 

pff\v)  :=i(A, 

2-iR+nl2)(\u\-l)  2"'^/2  ol^KII'^ll-”) 

iiAi-iAir 

Finally,  for  any  e  >  0,  we  define 

A*  :=  {i>  e  I(A,  A)  :  >  s/(”»A,iTOA,A#^A#^/i)  or  |i>|  <  J},  (13) 

which  concludes  the  construction  of  an  adaptive  approximation  d’^  ^  of  dx,fi- 

Remark  2.  The  construction  of  A*  according  to  (13)  seems  to  require  the 
explicit  knowledge  of  all  the  (infinite)  coefhcients  cp  of  c.  However,  one  can 

estimate  a  priori  a  level  such  that  x  #A^),  if 

\v\  >  Je.  Following  [2]  it  is  easy  to  show  that  this  is  valid  for 


Je- 


_ _ 

R  +  cr  +  n(l-i) 


(14) 


where  we  have  set 

Qe  :=  I  log2  ^  Aj^  ^A^)[  +  (i?  +  ri)J  +  [Ai(|A|  ~  s) 

+  lAKI^I  -  «)  +  Const. 

Replacing  computation  of  ax^y,  in  (3)  by  dx^y  results  in  an 

adaptive  approximation  of  ax,y.  As  already  mentioned,  one  can  construct 
an  adaptive  approximation  /j(  for  the  entry  of  the  right-hand  side  fx  '-= 
in  the  same  way. 

2.3.  Error  estimates 

Let  us  assume  that  the  boundary  value  problem  we  aim  at  approximating 
is  elliptic  of  order  25.  Let  H§{d)  be  the  closed  subspace  of  which 

accounts  for  the  given  boundary  conditions.  The  wavelet  basis  ^  intro¬ 
duced  above  is  assumed  to  form  a  Riesz  basis  of  this  space.  The  wavelet 
Galerkin  approximation  of  our  problem  is  obtained  by  replacing  by 

Sa  ■=  spanl-^A  *  A  G  A},  where  again  A  is  an  adaptively  chosen  subset  of  JT. 
The  corresponding  Galerkin  solution  will  be  denoted  by  ua 
The  vector  ua  :=  (ua)agA  is  obtained  by  solving  the  linear  algebraic  system 
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Fig.  1.  Physical  domain  (left)  and  exact  solution  (right)  for  the  numerical  tests. 

AaUa  =  fA,  which  is  defined  in  a  straightforward  manner.  Let  the  integrals 
which  appear  in  the  stiffness  matrix  as  well  as  the  right-hand  side  be  com¬ 
puted  in  an  approximate  way,  as  described  above.  Denote  the  resulting  matrix 
by  A\  and  the  resulting  vector  by  let  be  the  solution  of  the  modified 
linear  system  and  let  u\  = 

The  following  estimate  on  the  effect  of  the  described  approximation  of 
the  stiffness  matrix  and  the  right-hand  side  has  been  established  in  [2].  It  is 
readily  seen  that  it  also  holds  for  our  variant  of  the  original  method  in  [2]. 

Theorem  3.  Under  the  above  and  similar  assumptions  for  computing  the 
right-hand  side,  there  exists  Co  ^  0  such  that  for  ail  0  <  £  <  £o- 


<  □  (15) 

kAkfi  ~ 


§3.  Numerical  Results 

In  this  section,  we  present  our  numerical  results.  We  consider  the  Poisson 
problem  with  homogeneous  Dirichlet  boundary  conditions  on  a  domain  D 
which  is  the  parametric  image  of  Cl  under  a  suitable  transformation.  The 
domain  is  displayed  in  Figure  1,  left.  The  boundary  of  fl  consists  of  two 
straight  lines  and  two  curved  parts.  We  computed  the  parametrization  of  the 
four  parts  of  the  boundary  and  then  the  parametric  mapping  F  :  D  — »  H  is 
determined  by  transfinite  interpolation,  [14]. 

We  constructed  a  solution  u  which  satisfies  the  boundary  conditions  and 
which  has  a  strong  layer  near  the  upper  right  corner  of  the  domain.  This 
function  is  shown  on  the  right  in  Figure  1,  Since  we  have  an  explicit  formula 

for  we  determined  the  right-hand  side  /  by  using  MAPLE  V. 

The  choice  of  these  parameters  allows  us  to  test  an  interesting  situation. 
Indeed,  the  parametric  mapping  is  obviously  far  from  being  a  tensor  product. 
Hence,  we  can  study  the  influence  of  a  ‘realistic’  transformation.  Even  though 
this  influence  was  studied  in  [2]  in  ID,  we  face  here  a  non-tensor  product 
situation  for  the  first  time.  Moreover,  for  computing  the  right-hand  side,  two 
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o 

ii 

II 

UK 

=  Jo, 

l/^l=Jo  +  l 

=  M  =  Jo  + 1 

e 

4 

Jmax 

#Int 

Je 

Jmax 

#Int 

Jmax 

#Int 

0.5 

5 

3,4 

16,80 

8 

5,6 

108, 1264 

6 

4,5 

196,8150 

0.25 

5 

4 

18, 138 

8 

6 

142, 1603 

7 

5 

224, 13168 

0.125 

5 

4,5 

26, 196 

8 

6,7 

184, 2751 

5,6 

344, 18052 

0.0625 

7 

4,5 

35, 260 

10 

7 

281, 3924 

8 

6 

442, 21924 

0.03125 

8 

5,6 

44,454 

11 

7,8 

396,8048 

9 

6,7 

556,36600 

0.015625 

8 

6 

61,656 

11 

8 

716, 12044 

10 

7 

840,49916 

0.0078125 

9 

6,7 

82, 1318 

12 

8,9 

1152,27708 

11 

7,8 

1172, 100464 

Tab.  1.  Estimated  and  determined  maximum  level  and  number  of  integrals  in  A*. 


kinds  of  effects  are  present,  namely  the  parametric  mapping  and  the  layer  near 
the  corner.  We  stress  that  we  do  not  intend  to  study  any  particular  choice 
for  the  adaptive  discretization,  i.e.,  the  choice  of  the  set  A.  We  are  primarily 
interested  in  the  behaviour  of  the  adaptive  approximation  Qj^*c  in  a  realistic 
situation. 

As  trial  and  test  functions  we  used  the  biorthogonal  B— spline  wavelets 
on  the  interval  corresponding  to  the  parameters  d  =  d  =  2  (i.e.,  piecewise 
linear  primal  functions  and  dual  functions  of  lowest  possible  order)  from  [11] 
(see  also  [7]  for  the  original  construction  on  JR).  For  the  system  0*,  we 
choose  as  in  [2]  piecewise  linear  interpolatory  wavelets.  This  of  course  implies 
that  the  computation  of  the  corresponding  wavelet  coeflScients  can  easily 
be  performed.  Moreover,  since  piecewise  linear  interpolatory  wavelets  are 
nothing  else  than  hierarchical  B-splines,  the  integrals  in  (10)  actually  only 
contain  scaling  functions. 

We  used  the  parameters  a  =  2,  q  =  3/4,  4  :=  {k  +  1)“2  as  well  as  the 
corresponding  parameters  r  =  2,  p  =  3/4  and  4  :=  Aj-i  for  the  right-hand 
side,  [2]. 

In  the  ID  tests  in  [2],  the  parameter  s  was  chosen  as  the  error  in  the 
if^-norm  of  a  corresponding  uniform  discretization.  Prom  a  practical  point 
of  view,  this  is  of  course  unrealistic.  First  of  all,  the  solution  is  in  general 
not  known.  Moreover,  the  ultimate  goal  of  an  adaptive  scheme  is  to  avoid 
a  (high  level)  uniform  discretization  but  to  use  the  degrees  of  freedom  in  a 
more  economical  way.  Hence,  we  performed  various  tests  on  the  choice  of  the 
parameter  e. 

Our  first  test  concerns  in  Remark  2  and  the  number  of  integrals  needed 
for  computing  the  elements  of  the  stiffness  matrix.  Our  computations  are 
performed  in  this  way:  at  first,  we  start  from  the  minimum  level  (jo  =  3)  for 
the  used  wavelet  basis,  where  we  fix  a  certain  e,  then  we  solve  the  problem 
with  scaling  functions  and  wavelets.  In  Table  1  we  compare  the  theoretical 
estimate  Jg  on  the  maximum  level  in  A*  with  the  values  that  were  actually 
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determined  by  our  indicators  in  (13).  For  different  portions  of  the  stiffness 
matrix,  we  display  the  predicted  the  detected  maximum  levels  by  the 
indicator  as  well  as  the  minimum  and  maximum  number  of  integrals  needed 
for  computing  non  zero-entries. 

We  see  that  the  estimated  Je  is  always  larger  than  the  effectively  used 
maximum  level.  This  is  what  we  expected,  but  the  efficiency  of  the  algorithm 
may  be  reduced  by  an  excessive  over-estimate  of  .  We  also  deduce  that  the 
efficiency  of  the  method  crucially  depends  on  the  choice  of  e  since  the  number 
of  integrals  strongly  grows  for  decreasing  e.  This  is  surely  due  to  the  low  order 
of  the  interpolatory  wavelets  used. 

Next,  we  present  in  Table  2  the  average  number  of  integrals  computed 
for  the  stiffness  matrix  and  the  right-hand  side  for  the  first  two  levels  with 
the  same  e  of  Table  1.  Here  J^  :=  max{|A[  :  A  €  A}. 


£ 

0.5 

0.25 

0.125 

0.0625 

3.1e-2 

1.6e  -  2 

7.8e  -  3 

i 

KE3 

16.99 

24.15 

41.62 

ESQ 

EEEE9 

234.08 

305.98 

489.47 

738.67 

1336.59 

/ 

sifJBl;! 

55.69 

95.10 

Ja  =  4 

87.4 

87.4 

mem 

207.72 

411.61 

Tab.  2.  Average  number  of  integrals  per  entry. 


We  deduce  that  the  choice  of  e  not  only  influences  the  maximal  and 
minimal  number  of  integrals  as  shown  in  Table  1.  Since  the  average  number 
of  integrals  grows  when  e  decreases,  the  choice  of  e  effects  the  efficiency  of  the 
computation  of  the  whole  stiffness  matrix.  Moreover,  the  presence  of  the  first 
wavelet  level  also  increases  the  number  of  integrals. 

Finally,  we  consider  the  error  in  the  H^-norm  and  the  relative  error 


^ 

for  different  choices  of  the  parameter  e.  In  Table  3  and  Figure  2  ‘rate’  cor¬ 
responds  to  the  rate  of  convergence  w.r.t.  the  exact  solution  in  the  i7^-norm 
for  the  first  two  levels  in  A.  We  see  that  these  quantities  do  not  depend  on  e. 
At  these  levels  the  relative  discretization  error  still  exceeds  the  relative  error 
(15).  This  explains  why  the  rate  of  convergence  is  basically  constant  w.r.t. 
the  choices  of  e. 

Hence,  the  choice  of  £  matters  only  if  this  value  is  at  least  of  the  same 
order  than  the  relative  discretization  error.  We  remark  that  also  for  increasing 
£  all  scaling  functions  on  level  J  whose  support  overlap  belong  to  A  . 
This  implies  that  the  error  due  to  the  approximation  of  the  entries  of  the 
linear  system  is  bounded. 
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£ 

^■1 

1.9666 

0.1335 

0.25 

PiRgg 

IKklifa 

0.0625 

1.9658 

0.1338 

0.03125 

1.9563 

0.1343 

0.015625 

0.1342 

0.0078125 

■KMita 

0.1343 

Tab.  3.  Relative  error  and  rate  of  convergence  in  dependence  of  e. 


Fig.  2.  Rate  (left)  and  (right)  of  Table  3. 
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Scattered  Data  Near-Interpolation  with 
Application  to  Discontinuous  Surfaces 


Renata  Besenghi  and  Giampietro  Allasia 


Abstract.  This  paper  discusses  a  particular  type  of  function  approxima¬ 
tion  on  scattered  data  in  a  general  number  of  variables,  and  its  application 
to  surface  representation  with  imposed  conditions.  If  the  given  function 
values  are  subject  to  errors,  it  is  not  appropriate  to  interpolate  the  function 
at  the  data  in  the  sense  of  exact  matching.  As  a  consequence,  we  formulate 
a  weakened  version  of  the  classical  scattered  data  interpolation  problem, 
and  give  a  siniple  and  efficient  procedure  to  obtain  near-interpolation  for¬ 
mulas.  Near-interpolants  enjoy  many  remarkable  properties,  which  are 
very  useful  frorn  both  theoretical  and  practical  points  of  view  (shape  pre¬ 
serving  properties,  operator  positivity,  subdivision  techniques,  parallel  and 
multistage  computation).  Applications  of  near-interpolants  to  the  rep¬ 
resentation  of  surfaces,  in  particular  with  faults,  are  discussed  in  detail 
(parameter  values,  localizing  weights,  etc.). 


§1.  Introduction 

In  many  applications,  the  given  function  values  are  subject  to  errors;  hence 
it  is  not  appropriate  to  interpolate  the  function  at  the  data  in  the  sense  of 
exact  matching,  but  it  seems  more  appropriate  to  approximate  the  function 
or,  more  precisely,  to  get  a  relaxed  interpolation  or  near-interpolation.  Data 
requiring  near-interpolation  by  scattered  data  methods  occur  in  virtually  ev¬ 
ery  field  of  science  and  engineering.  Sources  include  both  experimental  results 
(experiments  in  chemistry,  physics,  engineering)  and  measured  values  of  phys¬ 
ical  quantities  (meteorology,  oceanography,  optics,  geodetics,  mining,  geology, 
geography,  cartography),  as  well  as  computational  values  (e.g.,  output  from 
finite  element  solutions  of  partial  differential  equations). 

As  a  consequence  of  this  remark,  we  formulate  a  relaxed  version  of  the 
classical  multivariate  interpolation  problem  at  scattered  data  points. 
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Definition  1.  Given  a  set  of  points  Sn  =  {xj,  i  =  distinct  and 

generally  scattered,  in  a  domain  D  C  (s  >  1),  with  associated  values 
{fi,  i  =  1, . . .  ,n},  and  a  linear  space  $(D)  spanned  by  continuous  real  basis 
functions  gj(x’,r)  with  x  e  l>,  r  >  0,  and  j  =  the  multivariate 

near-interpolation  problem  at  scattered  data  consists  in  finding  a  function 
F(x;r)  e  ^(D)  such  that 

n 

F(^Xi  57*)  =  ^  ^  ^j9j  (^i  j  ~  /i  ^  (1) 

i=i 

and 

limej(r)  =  0.  (2) 

r— >0 

We  observe  that  r  works  as  a  parameter,  and  the  limit  case  of  F(x;  r)  when  r 
vanishes 

F{x)  =  F{x;  0)  =  lini  F{x',  r) 

is  an  interpolation  operator.  If  F{x’r)  is  specified,  then  the  ei(r)  in  (1)  are 
known;  these  near- interpolation  errors  at  the  nodes  must  not  be  confused 
with  the  unknown  errors  which  affect  the  corresponding  function  values  fi. 
However,  it  is  reasonable  to  get  things  so  that  the  €i(r)  and  the  errors  on  fi 
are  quantities  of  the  same  order. 

In  Section  2  we  give  a  constructive  procedure  to  obtain  a  wide  class  of 
near-interpolation  formulas.  These  enjoy  many  interesting  properties  which 
are  listed  in  Section  3.  A  crucial  point  in  near-interpolation  is  the  proper 
choice  of  the  parameter  r  in  (1)  and,  eventually,  of  other  parameters;  the 
matter  is  discussed  in  Section  4.  Finally,  Section  5  is  devoted  to  the  application 
of  near-interpolation  to  modelling  faults. 

§2.  Construction  of  Near-Interpolants 

To  solve  the  classical  interpolation  problem,  one  can  consider  basis  functions 
which  depend  on  the  nodes  and,  moreover,  are  cardinal.  The  method  of  car¬ 
dinal  basis  functions  involves  selecting  continuous  cardinal  functions  gj  :  D 
H,  (j  =  l,...,n),  such  that  gj(xi)  =  Sij  ,  (i  =  l,...,n),  where  Sij  is  the 
Kronecker  delta  operator,  and  setting  up  the  interpolation  operator  F  in  the 
form 

n 

■p'(^) = fi  Sii^y 

j=i 

The  corresponding  near-interpolation  problem  considers  basis  functions 
gj{x\T),  which  are  no  longer  cardinal,  but  gj[x\r)  — >  gj(x)  for  r  0.  If  such 
gj(x\r)  are  given,  then 

F(x-,  r)  =  ^  fj  9j(x-,  r)  =  F{x)  +  fi  (2) 

J=1  j=l 

represents  a  solution  of  the  near-interpolation  problem.  In  this  relation  the 
terms  €i{r)  ~  F{xi]  r)  -  /*,  (z  =  1, . . . ,  n),  are  uniquely  determined  and  satisfy 
(2). 
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As  an  example,  let  us  consider  the  near-interpolant  Shepard’s  formula  in 
the  product  form 


3=1 


(4) 


where  d{x^  y)  is  the  Euclidean  distance  between  x  and  y,  and  /3  >  0,  or  in  the 
equivalent  barycentric  form 


n 

p{x\r)  =  Ylfj 
j=i 


[d‘^{x,Xj)+r]  ^ 


=  /i, 


i  =  1,  .  .  .  ,  71.  (5) 


This  formula  no  longer  interpolates  for  r  >  0,  but  p,{x]  r)  — >  fjL{x)  as  r  ^  0, 
where  p{x)  is  the  well-known  Shepard’s  formula  [8,  1]. 

Examining  the  structure  and  the  basic  idea  of  Shepard’s  operator  suggests 
a  simple  and  efficient  procedure  to  obtain  an  interpolation  formula  [1].  The 
corresponding  way  to  obtain  a  near-interpolation  formula  is  contained  in 

Definition  2.  Let  a{x,y;r),  with  x,y  E  D  and  r  >  0,  be  a  continuous 
positive  real  function  such  that 


lima(x,y;r)  =  a{x,y), 


(6) 


where  C((x,y)  >  0,  if  x  ^  y  and  a{x^y)  =  0,  if  x  =  y  for  all  x,y  E  D.  DeSne 
now  the  functions  gj(x;r)  by  the  equations 


gj(x;r)  = 


(7) 


and  the  near-interpolant  F{x;r)  by 


j=l  j^i  2^h=l  ih=l, k^h^\^’^k,r) 


or  equivalently  by 


=  fi,  i  =  l,...,n.  (9) 


Many  choices  are  possible  for  the  function  a(x,y;r)  in  (6);  there  are  no 
constraints  engendered  by  the  set  Sn,  that  is,  the  distribution  of  the  nodes 
is  irrelevant.  Nevertheless,  experience  suggests  identifying  a  with  a  radial 
function 


a{x,y’,r)  =  (l){\\x  ~  yf  +  r), 
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where  ||  •  ||  is  a  convenient  norm.  As  an  example,  choosing  the  Euclidean  norm 

II  •  II2  and 

Ci{x,y\r)  (\\x  -  yWl+rf ,  /?  >  0,  (10) 

we  obtain  from  (8)  the  near-interpolant  Shepard’s  formula  (4), 

It  is  often  convenient  to  consider  the  function  :  IR>o  K,  as¬ 
sociated  with  (l>{\\x  —  ?/|p  +  r)  and  defined  as  ^{t\r)  =  +  r).  We  have 

just  seen  (p{t]  r)  =  (t^  +  r)^  in  (10);  another  possible  expression  that  works  is 
(p(t;r)  =  (t^  +  r)^exp(j  >  0).  Considering  several  functions  (p  can  be 

useful  in  order  to  compare  their  behaviour  and  choose  the  most  suitable  for 
use. 


§3.  Properties  of  Near-Interpolants 

Near-interpolants,  as  given  in  Definition  2,  enjoy  many  interesting  properties. 
We  list  some  of  them. 

A)  The  near-interpolant  F(x;r)  in  (8)  or  (9)  is  a  weighted  arithmetic  mean 

of  the  values  fj,  (j  =  l,...,n),  since  0  <  gj(x;r)  <  1  and  0  = 

1.  As  a  consequence  F(x;r)  satisfies  the  betweeness  property  min^ /j  < 
F(x;r)  <  maxi  fi ,  and  reproduces  exactly  any  constant  function  f(x)  =  c, 
that  is,  if  fi  =  c,  (i  =  then  F(x;r)  =  c.  Moreover,  F(x;r), 

considered  as  a  functional  on  the  set  of  functions  /  ;  D  IR,  is  linear  and 
positive. 

B)  If  a(x,y;r)  is  infinitely  differentiable  with  respect  to  the  pth  compo¬ 
nent  of  a;  =  ,  x^^^)  for  all  x^y  E  D  and  p  —  1, . . . ,  s,  then  F{x\r)  is 

also  infinitely  differentiable  with  respect  to  the  x^^\  For  example,  choosing 
a{x,y;r)  as  in  (10),  F(x;r)  =  iJ,{x]r)  in  (4)  can  be  differentiated  as  many 
times  as  desired. 

C)  If  a  is  a  radial  function,  F(x;  r)  enjoys  some  properties  of  invariance  with 
respect  to  affine  transformations.  In  particular,  with  the  Euclidean  norm,  we 
have  that  E(x;  r)  is  invariant  under  translation  and  rotation,  but  not  scalar 
invariant. 

D)  Subdivision  techniques  can  be  applied  to  near-interpolants  achieving  re¬ 
markable  results,  very  well  suited  for  parallel  computation  [3].  Let  us  make  a 
partition  of  the  set  Sn  on  the  domain  D  into  q  subsets  Snj ,  so  that  the  jth 
subset,  (j  =  1, . . . ,  g),  consists  of  the  nodes  Xji,Xj2, . .  • ,  Xjn^ ,  with  ni  -f  712  + 

- h  rig  =  n,  and  the  values  fjkj ,  (j  =  1, . . . ,  g;  fcj  =  1, . . . ,  rij),  correspond  to 

the  nodes  xjk^-  The  indexing  of  the  nodes  in  the  subsets  may  not  depend  on 
the  indexing  in  the  set,  provided  the  biunivocity  is  saved. 

Given  Snj  =  {xji^  •  •  •  J  ^jnj  }?  (j  !?•••?  ^ni  G  Sji2  U  •  *  '  U 

Snp  and  n  =  0  for  q  ^  r,  then  F{x\  r)  in  (9)  can  be  rewritten  in  the 
form 

FsA^;r)  =  VF5  ,(x;r)  ^  ^  ,  where  Aj  =  ^  l/a{x,Xjk,;r).  (11) 

j=i 
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E)  As  a  consequence  of  (11)  the  following  multistage  procedure  works  very 
well.  In  the  first  stage,  a  given  set  of  nodes  Sm  =  {xi^i  =  l,...,ni}  is 
considered,  and  the  corresponding  near-interpolant  Fs^^{x\r)  is  evaluated. 
In  the  second  stage,  it  is  required  to  enlarge  the  considered  set  taking 
the  union  of  it  and  another  set  of  nodes  Sn-2.  =  {xj^j  =  1,  •  •  • ,  ^2}.  Now  the 
near-interpolating  function  referred  to  the  union  set,  i.e., 

Snx  n  Sn^  =  0,  can  be  obtained  simply  by  evaluating  the  near-interpolant 
corresponding  to  the  added  set  5^2 ?  and  using  the  relation 


jx]  r)Ai  +  (a:;  r)A2 

Ai  -f  A2 


(12) 


where  Ai  =  l/Q;(a;,Xj;r),  A2  =  is  known. 

The  procedure  can  be  repeated  as  many  times  as  required. 


F)  Near-interpolants  have  the  remarkable  property  that  an  additional  node, 
say  a:„+i,  can  be  added  to  the  interpolation  set  Sn  by  simply  combining  an 
extra  term  with  the  original  formula.  The  goal  is  achieved  by  using  a  particular 
case  of  (12),  that  is,  the  recurrence  relation 

r.  /  ^  FsS^\r)AnA  fn+llloi{x,Xn^.l]r) 

+  -  ’ 

where  =  ^loL{x,Xk\r), 

G)  It  is  often  convenient  to  extend  (8),  or  better  (9),  in  the  following  way: 


F,{x-,r)  =  Y^Si 

3=1 


T{x,Xj\'))  lla(x,Xj]r) 
YJh=i'^i^^^h\l)  \la(x,Xh\r)  ’ 


(13) 


where  r(a:,2/;7),  with  x,y  e  D  and  7  >  0,  is  a  continuous  positive  real 
function.  Choosing  suitably  T{x^y]‘j),  one  can  modify  the  weights  in  (13) 
in  order  either  to  cancel  a  useless  characteristic,  or  to  introduce  a  new  fea¬ 
ture.  In  particular,  it  is  possible  to  localize  the  method  considering  a  factor 
r(a:,2/;7)  rapidly  decreasing  with  distance  [2].  The  formulas  obtained  in  this 
way  maintain,  in  general,  the  analytical  and  computational  properties  of  the 
corresponding  original  ones. 

The  use  of  the  exponential-type  function 

t{x,  y;  7)  =  exp(-7||a;  -  y\\l)  (14) 


is  suggested  by  McLain  [4]  for  Shepard’s  formula;  he  observes  that  much  more 
accurate  results  can  be  obtained  in  this  way.  The  use  of  exponential-type 
weights  increases  the  computational  effort,  but  generally  this  drawback  can 
be  tolerated. 

The  value  of  the  parameter  in  the  mollifying  function  r(x,y]'y)  may  de¬ 
pend  on  the  nodes,  as  happens  in  the  popular  case  [4] 

T{x,Xj-,Pj)=  (1-^^ - .  (15) 

\  pj  /  ^ 

where  pj  is  the  radius  of  the  circle  of  support  at  the  point  Xj,  and  (u)+  >  0 
if  u  >  0,  ('u)+  =  0  if  u  <  0. 
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H)  The  precision  of  the  operator  F(x;  r)  can  be  increased,  considering  the 
Taylor  expansion  for  the  function  /  in  each  node  xj  instead  of  the  function 
value  fj.  This  leads  to  the  following  extension  of  (9). 

If  /  G  C^{D)  and  Tj{x)  is  the  truncated  Taylor  expansion  for  /  up  to 
derivatives  of  order  m  evaluated  at  the  point  xj  and  referred  to  the  displace¬ 
ment  hj  =  X  —  Xj,  with  hj  C  D,  then  the  operator 

n 

F2{x-,T)  =  '£Tj{x)gi{x-,r),  (16) 

J  =  1 

near- interpolates  to  Tj(x)  at  x  =  xj.  In  this  form,  F2{x\  r)  reproduces  exactly 
algebraic  polynomials  of  degree  <  m. 

Combining  the  modifications  in  (13)  and  (16),  we  have 


F3(x-,T)  =  YlTj[x) 
j=l 


T(x,Xj;^)  l/a{x,xj;r) 

l/a{x,xh’,r)  ‘ 


(17) 


Obviously,  the  technique  calls  for  additional  derivative  values  that  are  not 
normally  available  as  data.  A  more  practical  solution  is  discussed  below. 

K)  For  simplicity,  we  refer  here  to  an  Euclidean  radial  function  q:(x,  y\ r)  = 
(/>(||a^  —  2/II2  +  '^)5  because  in  this  case  the  procedure  is  well  established.  The 
primary  modifications  required  involve  using  t{x,  y;  7)  to  localize  the  overall 
approximation,  and  replacing  fj  with  a  suitable  “local  approximation”  to 
the  surface.  To  carry  out  the  approximation  (17),  a  practical  way  is  to  get, 
in  a  first  stage,  local  approximants  Mj(x)  to  f(x)  at  the  points  Xj,{j  = 
l,...,7i),  obtained  by  means  of  the  moving  weighted  least-squares  method 
using  weight  functions  with  reduced  compact  support.  Then,  in  a  second 
stage,  the  near-interpolating  operator  is  expressed  as  a  convex  combination  of 
the  local  approximants 


n 


FA{x',r)  =  Y, 

j=i 


r{x,Xj',-f)  l/HWx-XjWl  +  r) 

IMII^  -  +  ^)  ' 


(18) 


In  particular,  by  (18),  (10),  and  (14), 


n 

i=i 


Gxp{-j\\x  ~  XjWl)  {\\x  -  XjWl  F  r)  ^ 

ELi exp(-7lk  -  xhWl)  (11^  -  ^hWl  +  ’ 


(19) 


which  extends  (5). 

Very  good  performance  is  achieved  by  a  version  of  (18)  which  uses  quad¬ 
ratic  approximations  for  Mj{x),  and  mollifying  functions  given  by  (15).  This 
method  has  been  developed  by  Franke  and  Nielson  [4],  and  Renka  [7]  for 
Shepard’s  operator. 
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§4.  Determining  Parameter  Values 

The  near-interpolating  operator  (19)  works  very  well  in  a  large  variety  of 
cases.  Our  attention  is  here  focused  on  finding  values  of  the  parameters  r,  ^ 
and  7,  which  can  be  regarded  as  “optimal”  from  a  practical  viewpoint.  The 
considerations  which  follow  are  mainly  based  on  experiments. 

A  relatively  small  increase  of  the  parameter  r  in  (19)  in  some  right  neigh¬ 
bourhood  of  zero  has  a  considerable  effect  on  the  behaviour  of  r).  In  fact, 
if  r  is  small,  x  fixed  and  near  to  the  node  Xj* ,  then  the  value  of  gj*  {x\  r)  equals 
nearly  one;  but,  if  r  increases,  gj*{x;r)  decreases.  Since  SJ-i  ^^(aJir)  =  1, 
diminishing  of  gj*{x]r)  makes  the  other  weight  values  gj{x’,r),  j  ^  j*,  in¬ 
crease.  Summing  up,  if  the  weight  attributed  to  fj*  in  fii{x;r)  decreases, 
then  fii{xj*;r)  =  fj*  +  diverges  from  fj*,  namely  ej*{r)  increases  and 

reduces  the  accuracy  of  fxi{xj*’,r). 

Introducing  the  parameter  r  in  (19),  and  in  particular  in  (5),  has  the 
effect  that,  in  general,  the  gradient  of  the  rendered  surface  is  not  zero  at  the 
nodes.  As  a  consequence,  the  surface  is  considerably  smoother  than  for  r  =  0. 
However,  if  r  is  too  small,  the  first  derivatives  of  pLi(x',  r)  are  highly  oscillating 
and  their  values  are  nearly  zero.  Clearly,  the  goal  is  to  choose  an  “optimal” 
value  of  r,  such  that  /j,i(x;r)  does  not  exhibit  the  characteristic  irregularities 
of  the  basic  Shepard’s  formula,  but  at  the  same  time,  it  maintains  a  sufficient 
computational  accuracy,  in  particular  at  the  nodes. 

The  search  for  the  optimal  value  of  r  can  be  done  by  many  applications 
of  (19)  with  different  values  of  the  parameter,  and  then  by  choosing  that  value 
which  minimizes  the  global  root  mean  square  error.  Although  this  is  currently 
considered  in  the  literature,  the  estimate  of  r  is  not  a  simple  matter;  in  a 
sense,  it  can  be  compared  with  the  analogous  difficult  problem  of  computing 
the  optimal  value  of  the  parameter  in  multiquadric  interpolants. 

The  optimal  value  of  the  parameter  /?  has  been  determined  with  particular 
attention  to  computational  accuracy.  The  performance  analysis  on  some  test 
functions  proposed  by  Franke  leads  to  prefer  the  value  /?  =  3/2. 

As  for  the  optimal  value  of  the  parameter  7  in  the  strongly  localizing 
function  (14),  McLain  has  proposed  7  =  1.62n/diam(£>),  where  n  is  the  num¬ 
ber  of  nodes  and  diam(£))  is  the  diameter  of  D.  However,  this  value  is,  in 
general,  too  large,  whereas  it  is  sufficient  to  consider  for  7  a  value  of  the  order 
of  tens. 


§5.  Application  to  Modelling  Faults 

Using  the  near-interpolating  operator  /xi(x;r)  of  (19),  with  a  suitable  value  of 
the  parameter  r,  instead  of  the  corresponding  interpolating  operator  /j,i(x;  0), 
increases  considerably  the  performance  of  the  approximation  in  a  rich  variety 
of  applications,  because  it  permits  consideration  of  supplementary  information 
connected  with  the  characteristics  of  the  examined  problem.  A  typical  case 
occurs  with  surface  discontinuities,  in  particular  faults,  which  are  frequently 
met  when  modelling  geological  surfaces. 
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Following  Shepard  [8],  we  observe  that,  if  some  physical  barrier  such  as  a 
fault  separates  the  set  of  nodes,  the  relationship  between  nodes  on  the  opposite 
sides  of  the  barrier  may  be  attenuated.  Through  the  inclusion  of  barriers,  a 
user  may  specify  discontinuities  in  the  metric  space  in  which  the  distance 
between  two  points  is  calculated  to  simulate  this  attenuation.  Suppose  a 
“detour”  of  length  b{x^Xk)  were  required  to  go  over  the  barrier  between  x, 
the  current  near-interpolation  point,  and  the  node  Xk-  The  quantity  b{x,Xk) 
is  considered  the  strength  of  the  barrier,  and  an  effective  distance  between  x 
and  Xk  is  given  by 


d*{x,Xk)  =  \/ld{x,Xk)]'^  -f-  [b{x,XkW. 

This  definition  is  general  so  that  if  no  barrier  separates  x  and  x/t,  then 
6(x,  Xk)  =  0  and  d*{x,Xk)  =  d{x^  Xk).  Because  of  the  discontinuity  in  effective 
distance  as  the  near-interpolation  point  x  crosses  the  barrier,  the  rendered 
surface  will  be  discontinuous  at  the  barrier. 

Since  extensive  tests  [4]  have  shown  that  the  modified  quadratic  Shepard’s 
method  performs  very  well  for  a  variety  of  data  sets,  Franke  and  Nielson  [5] 
have  chosen  it  as  a  basis  to  investigate  the  problem  of  simulating  faults.  Our 
approach  uses  instead  the  near-interpolant  (19),  with  significant  differences  in 
distance  penalty,  localizing  functions,  fault  forms,  etc.,  as  compared  to  Franke 
and  Nielson. 

The  possibility  of  having  to  model  faults  can  occur  in  different  ways  [5]; 
to  save  space,  we  limit  our  attention  to  the  following  case:  there  is  a  known 
fault  line  F  C  D  C  in  a  known  location,  with  a  known  jump.  More 
complicated  situations  (see,  e.g.,  [5,6])  require  extensive  considerations  that 
will  be  discussed  in  a  further  work. 

As  a  first  step,  it  is  convenient  to  focus  on  the  basic  situation  in  which 
the  fault  line  F  is  a  segment  I  and,  moreover,  the  jump  is  constant  along  1. 
Then,  a  known  polygonal  curve  can  be  considered  as  a  fault  line;  in  fact,  the 
reduction  to  the  case  of  a  fault  line  segment  is  straightforward  by  using  the 
subdivision  procedure  considered  in  Section  3.  In  principle,  any  curve  can  be 
considered  as  a  fault  line,  provided  it  is  well  approximated  by  a  polygonal. 
Another  extension  consists  in  considering  a  jump  varying  along  the  fault  line. 
Also  the  reduction  to  the  basic  case  is  now  possible,  subdividing  the  fault  line 
into  a  convenient  number  of  segments,  and  using  a  mean  value  of  the  jump 
for  each  segment. 

To  deal  with  the  basic  case,  we  modify  the  value  of  the  parameter  r  in 
(19)  in  order  to  take  the  jump  into  account.  Let  x  be  the  near-interpolation 
point,  Xk  a  node  and  I*  the  segment  joining  x  and  Xk-  Then  for  x,Xk  ^  I  we 
S0t 

^  _  f  ropt ,  if  /  n  r  =  0, 
t6(x,Xfc),  if 

where  the  quantity  Xopt  is  the  optimal  value  obtained  for  r  in  (19)  on  the 
opposite  sides  of  the  fault  and  b{x,Xk)  represents  the  “effort”  required  to  go 
over  the  barrier,  due  to  the  discontinuity  dividing  the  two  points.  If  the  jump 


Fig.  1.  Function  /i(x,y):  near-interpolation  and  signed  error  surfaces. 


Fig.  2.  Function  f2{x,y):  near-interpolation  and  signed  error  surfaces. 


is  constant  or  almost  constant  along  Z,  it  is  possible  to  simply  set  b{x,  Xk)  =  h, 
where  h  is  the  jump  size. 

Formula  (19),  after  these  adjustments  in  the  parameter  r,  gives  results 
quite  good  both  for  the  appearance  of  the  graphic  representation  and  the  accu¬ 
racy  in  computation.  Comparing  the  rendered  surface  with  the  one  obtained 
by  the  modified  quadratic  Shepard’s  formula  shows  that  the  introduction  of 
the  parameter  r  gives  a  smoother  surface  which  is  closer  to  the  approximated 
function. 

Our  procedure  has  been  used  to  fit  the  test  function  proposed  by  Pranke 
and  Nielson  [5]  using  their  set  of  nodes.  Numerous  tests  were  also  made  on 
other  surfaces.  We  present  two  examples  of  the  rendered  surfaces  and  the 
signed  error  surfaces  for  the  functions 


0.3,  if0<ar<0.5, 
0.7,  ifar>0.5; 


f2{x,y) 


y  -  x-{-l,  if0<a:<  0.5, 

0,  if  0.5  <  a;  <  0.6, 

0.3,  if  a?  >0.6, 
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defined  on  the  unit  square  (see  Fig.  1  and  Fig.  2).  We  used  the  parameter 
values  r  =  0.0036, =  1-5,7  24,  and  r  =  0.0025, =  1.5,7  =  30  respec¬ 

tively,  and  once  again  the  set  of  nodes  of  Franke  and  Nielson.  The  errors  can 
be  considerably  reduced  by  adding  more  information  on  the  faults;  in  fact, 
the  employed  set  of  nodes  is  not  obviously  an  ad  hoc  choice. 
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On  a  Method  of  Numerical  Differentiation 


Mira  Bozzini  and  Milvia  Rossini 


Abstract.  In  this  paper  we  present  a  method  for  the  numerical  differen¬ 
tiation  of  two-dimensional  functions  when  scattered  data  are  given.  The 
method  is  based  on  a  regularization  of  the  given  sample. 


§1.  Introduction 

In  this  paper  we  present  a  method  for  the  numerical  differentiation  of  two- 
dimensional  functions  when  scattered  data  are  given.  The  problem  of  numer¬ 
ical  differentiation  is  very  important  when  dealing  with  function  approxima¬ 
tion.  In  fact,  a  satisfactory  recovery  of  a  function  given  in  a  sampled  form 
needs  some  knowledge  of  the  derivatives. 

It  is  also  well  known  that  this  problem  is  ill-conditioned.  Consider  for 
instance,  one-dimensional  equispaced  data  with  hi  =  10^  and  /12  =2^,  On  a 
computer,  because  of  the  base  change,  numerical  differentiation  gives  consid¬ 
erable  errors  in  the  first  case,  and  a  more  accurate  solution  in  the  second  case. 
Moreover,  it  is  strongly  influenced  by  the  data  position. 

The  literature  on  scattered  data,  includes  the  papers  [7,8,10],  their  im¬ 
provements  [3,5],  and  some  experiments  on  their  use  [9].  These  papers  provide 
the  gradient  approximation  at  the  sampled  points  (see  [7,10])  or  the  approxi¬ 
mation  of  the  gradient  function  (see  [8]).  This  is  done  by  triangulation  or  local 
and  global  moving  least  square  interpolation.  In  some  of  them,  asymptotic 
bounds  for  the  error  are  also  supplied. 

Generally,  these  methods  provide  a  satisfactory  approximation  inside  the 
domain  in  which  the  data  are  given,  but  they  may  give  large  errors  at  the 
boundary  (see  Figs.  1-4  below). 

In  this  paper  we  present  a  method  based  on  a  regularization  of  the  sample 
which  gives  an  error  with  an  uniform  behaviour  on  the  whole  domain  in  which 
the  data  are  assigned.  The  regularization  is  done  by  constructing  a  new  set 
on  a  regular  grid.  Namely,  taking  into  account  the  previous  observations,  we 
have  considered  dyadic  grids.  Obviously  the  construction  of  this  new  set  will 
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generate  errors  that  can  be  thought  of  as  causal  errors  depending  on  the  sam¬ 
ple.  Then  (see  [4]),  to  smooth  the  data,  we  perform  a  wavelet  decomposition 
of  the  signal.  Using  the  smoothed  data,  we  approximate  the  gradient  at  the 
grid  points  by  the  classical  finite  centered  difference  formulas,  and  finally  we 
construct  a  smooth  function  approximating  the  unknown  gradient. 

The  paper  is  organised  as  follows.  In  §2  the  method  is  described  and 
the  gradient  estimator  is  constructed.  In  §3  the  convergence  properties  are 
considered.  Finally  in  §4  we  discuss  some  questions  related  to  the  numerical 
aspects  of  the  problem  and  we  provide  some  numerical  examples. 

§2.  The  Method 

Suppose  we  are  given  a  set  of  scattered  points  in  a  domain  Q  C 
S^{P:(xi,yi)\P:  eQ,i  =  h...,N}, 
and  the  set  of  functional  data 

F  =  {(P*,/(i^*)),t  =  l,...,iV}, 

where  f{x^y)  is  an  unknown  function  defined  on  Q.  Without  loss  of  generality, 
we  suppose  Q  =  [0, 1]  x  [0, 1]. 

The  first  step  of  our  method  consists  in  generating,  from  F,  a  new  set  of 
functional  values,  say  F,  located  on  a  dyadic  lattice  T  of  Q, 

T  =  {a,  k  =  (fci,  fc2)  €  2Z^  k  =  1, . . .  ,2"}, 

that  is 

F  =  m,m)),  a,€T}, 

It  is  clear  that  the  new  values  are  affected  by  errors 

M)  =  /(^*)  +  e(P»). 

Therefore  we  need  to  construct  F  by  an  efficient  computational  method  such 
that  the  error  is  less  or  of  the  same  order  as  that  generated  by  the  derivative 
approximation  we  will  use.  A  possible  strategy  is  to  interpolate  the  data  by  a 
local  method  of  a  suitable  order  m,m  <  a  (for  instance  a  moving  least  squares 
technique,  [6,  8])  which  gives  a  smooth  interpolating  function  f{x,  y)  G  C^{Q) 
with  an  error  e(x,y)  =  0(/i’^),  where  h  is  a  local  parameter  depending  on  the 
distribution  of  the  points  P^  in  Q  (usually  h  =  1/\/N). 

The  new  set  F  can  be  thought  of  as  a  sample  coming  from  a  stochastic 
process  depending  on  the  points  P^  G  S.  Then  we  can  use  the  method  de¬ 
scribed  in  [4]  for  noisy  data.  In  the  next  section,  we  will  perform  a  wavelet 
decomposition  in  order  to  smooth  the  errors  e,  and  we  will  define  an  estimator 
5j(n)(^5  2/)  of  fhe  underlying  function  f{x,y).  Then  we  will  approximate  the 
unknown  gradient  using  the  function  estimator  ?/),  and  the  classical 

approach  of  central  finite  differences. 
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In  this  paper  we  assume  that 

i)  y)  belongs  to  an  Holder  space  of  order  a  >  2onQ*  D  say  C°‘{Q*). 

ii)  We  consider  a  multiresolution  analysis  of  given  by  the  tensor 

product  of  two  one-dimensional  s-regular  multiresolution  analysis  such 
that  the  scaling  function  0  is  a  coiflet,  that  is  a  compactly  supported 
orthonormal  function  such  that  the  scaling  function  0(x)  and  the  wavelet 
iplx)  have  L  —  1  and  L,  vanishing  moments  respectively.  Moreover,  we 
assume  that  L>  [a] +  1  and  s  >  a. 

2.1.  The  function  estimator 

As  mentioned  above,  for  approximating  the  gradient  of  f{x,y),  we  will  use 
the  function  gj(^n'^(x,  y)  defined  as  in  [4].  In  this  section  we  briefly  describe  its 
definition  and  the  motivations  that  lead  to  consider  this  function. 

It  is  known  that  when  we  perform  a  MRA  using  data  given  on  a  subset  of 
,  we  need  to  take  into  account  the  problem  of  reducing  the  boundary  errors. 
To  this  end,  we  consider  a  function  g{x,y)  6  (7“(]R^)  compactly  supported 
on  Q*  D  Q  such  that 


g(x,y)  =  f(x,y),  V(x,y)  e  Q, 

and  a  new  dyadic  lattice  T*  on  Q*  of  dimension  2"  x  2”,  such  that  T*  = 
{T  U  {  points  sampled  in  Q*  \  Q}}.  Moreover,  the  advantage  of  the  nested 
structure  of  a  MRA  is  that  to  provide  an  efficient  tree-structure  algorithm 
for  the  decomposition  of  functions  in  Vn  for  which  the  smoothing  coefficients 
(p,  ^n,k)  are  given. 

In  applications,  a  function  is  given  in  sampled  form,  and  it  is  therefore 
necessary  to  approximate  the  projection  on  the  space  Vn,  by  some  oper¬ 
ator  Hn,  and  to  derive  a  reasonable  estimator  of  n„  in  terms  of  the  sampled 
values.  The  choice  of  n„  and  of  its  estimator  is  suggested  by  the  following 
facts  (see  [1,4]). 

The  set  of  nonzero  coefficients  (5,  $n,fc}  bas  cardinality  equivalent  to 
0(2^”).  Moreover, 

<C2-"2-"“,  (1) 

where  C  is  a  constant  depending  on  the  smoothing  function  ^{x,y)  and  on 
9{x,y)- 

As  a  consequence,  we  define 

{Ung){x,y)  =  2-^  9{Pk)^n,k{x,y).  (2) 

Pf^eT* 

Since  we  have  data  corrupted  by  the  interpolation  errors,  we  consider  the 
estimator  of  n„ 

(n„s)(^:,y)  =  2-"{^  /(Pfc)4.„,fc(x,y)+  Y.  ff(^k)$„.k(x,y)}.  (3) 

Pjfe€T  Pk€T*\T 
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This  estimator  may  lead  to  an  oscillatory  solution  bearing  too  much  fidelity  to 
the  data.  Since,  for  numerical  differentiation,  we  need  to  correctly  smooth  the 
data,  we  have  to  associate  to  each  sample  of  size  2"  x  2"  a  resolution  j(n)  <  n, 
and  to  consider  the  orthogonal  projection  of  {tlng){x,y)  onto  V'j(fi),  that  is 

9j{n){^,y)  = 

The  parameter  j{n)  governs  the  smoothness  of  our  estimator,  and  it  is 
important  to  choose  it  in  the  right  way  because  it  controls  the  tradeoff  between 
the  fidelity  to  the  data  and  the  smoothness  of  the  resulting  solution.  From  a 
theoretical  point  of  view,  the  smoothing  parameter  must  tend  to  infinity  at 
the  correct  rate,  as  the  amount  of  information  in  the  data  grows  to  infinity. 

2.2.  The  gradient  estimator 

We  now  consider  the  construction  of  the  gradient  estimator.  When  dealing 
with  gridded  data,  it  is  natural  to  approximate  the  gradient  using  the  usual 
centered  difference  formulas.  Let 

{gradf){x,y)  =  f^y\x,y)^  , 

be  the  gradient  of  f{x,y),  and  let  be  the  centered  difference  operators 

which  use  r  equispaced  points  in  the  x  or  y  direction  respectively.  If  r  <  [a], 
we  know  that 


(Z)‘/)(Pt)  =  f(‘>(Pk)  +  t  =  {x,  y}. 

Then,  at  each  point  of  the  lattice  T,  we  approximate  (grad  f)(Pk)  by 

{gTadf){Py,)  =  W5i(.))(a)) .  (5) 

Using  the  data  (5),  it  is  possible  to  define,  at  each  point  of  Q,  an  estimator  of 
f^^\x,y),  t  =  {x,y},  by  the  operator  Hn.  Namely,  we  consider  the  function 
g{xjy),  and  we  define 

{gTadf){x,y)  =  ((n„g<">)(x,j/),(n„g(»))(i,y))  ,  (6) 


where 


(nn9<‘>)(x.i,)  =  2-"{  (Djg,(^,)(a)4-„,k(x,j/) 

+  E  g''’(a)4'„,k(x,y)}. 

PkeT*\T 
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§3.  Asymptotic  Properties 

Under  the  assumptions  of  Section  2,  and  taking  into  account  the  properties 
of  the  projection  Pvj »  we  know  that 

\(Pv.g){x,y)-g{x,y)\  =  0{2-‘‘^),  'i(x,y)eQ\  (7) 

Moreover,  from  (1)  we  have 

|(n(s)(x,3,)-(Pv,g)(r<;,y)|  =  0(2-'“)  v(x,g)6  0*,  (8) 

then 

|(ni5f)(x, y)  -  f{x, y)\  =  0(2"^“)  V(x,  y)  €  Q.  (9) 

Using  relations  (7),  (8),  (9)  and  the  results  stated  in  [4],  we  have  proved 
Proposition  1.  If  assumptions  i)  and  ii)  of  Section  2  hold,  we  have 

Mx,y'l  -  f{x,y)\  =  0(2-’W“)  + 
for  every  (x,  ?/)  G  Q. 

Remark  1.  This  result  points  out  how  the  choice  of  j{h)  depends  on  the 
sample  dimension  N  and  on  m.  In  fact  it  has  to  be  chosen  so  that  is 

less  or  of  the  same  order  of  h^. 

Proposition  2.  If  assumptions  i),  ii)  of  Section  2  hold,  the  approximation 
(6)  of  the  gradient  satisfies,  asymptotically,  the  following  bound 

l(n„g(‘))(x,y)  -  =0(2-^(*>“)  +  0(2-”<’-i>) 


for  every  (x,  y)  €  Q. 


§4.  Numerical  Results 

In  this  section,  we  discuss  some  questions  related  to  the  computational  costs 
and  to  the  numerical  implementation  of  the  method  we  have  studied.  We  also 
present  some  numerical  results. 

4.1.  Computational  costs 

The  computational  costs  are  essentially  given  by  the  wavelet  decomposition 
and  by  the  construction  of  the  gridded  data  set  F  of  dimension  2^”^.  For 
the  wavelet  decomposition,  they  are  at  the  most  of  the  same  order  of  the 
sample  dimension,  that  is  0(2^").  For  the  construction  of  F,  if  we  use  a  local 
method  of  order  m  <  A,  they  are  given  by  the  solution  of  N  linear  systems 
of  dimension  2m  +  1.  Then  the  computational  costs  will  be  + 

0(22”). 
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4.2.  Numerical  implementation 

In  this  section  we  discuss  some  questions  related  to  the  construction  of 
gj(^n^{x,y)  and  .  Following  the  idea  of  §3.1,  we  need  to  extend 

the  given  signal  to  a  suitable  square  Q*  D  Q 

Q*  =  1  +  2~^K\  X  [-2-^K,  1  +  2-^Ki 


forcing  it  to  be  zero  at  the  Q*  boundary.  This  is  necessary  in  order  to  avoid 
undesirable  behaviour  at  the  boundary.  The  extension  can  be  done  following 
a  method  proposed  in  [2].  Note  that  K  is  related  to  the  number  of  points  we 
consider  outside  Q.  On  one  hand,  it  cannot  be  chosen  too  small,  otherwise 
undesirable  boundary  oscillations  could  occur.  On  the  other  hand,  it  depends 
on  the  resolution  level  j(n).  In  fact,  it  has  to  be  chosen  so  that  the  discrete 
wavelet  decomposition  can  be  performed. 

In  the  numerical  examples  we  have  used  the  coiflets  with  L  =  6  vanishing 
moments.  For  the  pointwise  gradient  approximation,  we  have  used  the  central 
finite  difference  of  order  4  (r  =  5). 


4.3.  Numerical  examples 

In  this  paper  we  present  the  results  achieved  for  the  test  functions 


fi(x,y)  =  0.75exp|-((93:  -  2f  +  (9j/  -2)^)/4] 

+  0.75exp[-((9x  +  1)749  +  (9y  +  l)7l0)] 
+  0.5exp[-((9x  -  +  (9y  -  3)^)/4] 

-  0.2exp[-((92:  -  4)^  +  (9y  -  7)^)], 

f2(x,y)  =  ,  =. 

7(1  +  2exp(-27l00x2  + 1001/2  -  6.7) 


defined  on  the  unit  square  [0, 1]  x  [0, 1].  We  have  considered  N  scattered  points 
on  Q,  and  have  constructed  a  new  gridded  data  set  F  of  size  2”  x  2^  using 
the  modified  quadratic  Shepard  method.  The  smoothing  parameter  j(n)  has 
been  chosen  taking  into  account  Remark  1  of  Section  3.  Therefore,  having 
used  the  modified  quadratic  Shepard  method  with  n  =  5,  a  possible  choice  is 
j(n)  =  4. 

We  now  present  our  results,  and  compare  them  with  those  obtained  with 
the  method  (L-method)  proposed  in  [8]  which,  among  those  we  find  in  the 
literature,  we  belive  is  preferable  both  for  its  theoretical  aspects  and  numerical 
performance. 

The  following  examples  show  how  the  proposed  method  provides  an  ap¬ 
proximation  which  seems  to  have  the  same  behaviour  for  the  functions  con¬ 
sidered,  both  for  graphical  results  and  for  errors.  For  the  sake  of  brevity,  we 
show  only  the  approximations  of  one  gradient  component,  but  give  the  relative 
errors  for  both  of  them. 
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Fig.  1.  Example  1,  N  =  100.  On  the  left-hand  side,  the  result  of  our  method. 
On  the  right-hand  side,  the  result  of  the  L-method. 


Fig.  2.  Example  1.  jV  =  300.  On  the  left-hand  side,  the  results  of  our  method. 
On  the  right-hand  side,  the  result  of  the  L-method. 


Example  1.  Consider  N  —  100  and  N  =  300  values  of  fi{x^y).  In  Figs.l 
and  2  we  present  the  y-partial  derivative  approximations.  The  following  table 
lists  er:=  relative  error  of  our  method  and  erL:=  relative  error  of  the  L-method: 

/i  AT  ^  100  N  =  300 
er  erL  er  erL 
r  24%  55%  6%  12% 

fy  43%  226%  36%  36% 
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Fig.  3.  Example  2.  N  =  100.  On  the  left-hand  side,  the  result  of  our  method. 
On  the  right-hand  side,  the  result  of  the  L-method. 


Fig.  4.  Example  2.  N  =  300.  On  the  left-hand  side,  the  result  of  our  method. 
On  the  right-hand  side,  the  result  of  the  L-method. 


Example  2.  Consider  N  =  100  and  N  =  300  values  of  f2{x,y).  In  Figs.  3 
and  4  we  present  the  results  achieved  for  the  y-partial  derivative,  and  in  Fig.  5 
the  error  functions  for  N  =  300.  The  following  table  lists  er:=  relative  error 
of  our  method,  and  erL:=  relative  error  of  the  L-method: 


/2  =  100  N  =  300 

er  erL  er  erL 
35%  400%  6%  54% 

27%  64%  6%  135% 


r 

P 
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Fig.  5.  Example  2.  N  =  300.  On  the  left-hand  side,  the  error  function  of  our 
method.  On  the  right-hand  side,  the  error  function  of  L-method. 


0  0 


Fig.  6.  Example  2.  N  =  300.  The  approximation  of  f2{x,y).  The  absolute 
maximum  error  is  0.015. 

Finally,  as  is  usual  in  the  literature,  we  consider  an  application  to  function 
recovery  which  shows  the  goodness  of  the  gradient  approximation.  We  recover 
f2{x,y)  by  Hermite  interpolation  at  16  x  16  nodes,  where  we  interpolate  the 
data  coming  from  the  estimator  gj(^n>j{x,y)  and  from  the  approximated  gradi¬ 
ent  (Fig.  6). 
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Ridgelets  and  their  Derivatives: 
Representation  of  Images  with  Edges 


Emmanuel  J.  Candes 


Abstract.  This  paper  reviews  the  development  of  several  recent  tools 
from  computational  harmonic  analysis.  These  new  systems  are  presented 
under  a  coherent  perspective,  namely,  the  representation  of  bivariate  func¬ 
tions  that  are  singular  along  smooth  curves  (edges).  First,  the  represen¬ 
tation  of  functions  that  are  smooth  away  from  straight  edges  is  presented, 
and  ridgelets  will  be  shown  to  provide  near  optimal  nonlinear  approxi¬ 
mations  to  these  objects.  Motivated  by  the  limitations  of  the  ridgelet 
methodology,  new  representation  systems,  namely,  monoscale  ridgelets 
and  curvelets  -  both  of  which  use  the  ridgelet  transform  as  a  building 
block  -  will  be  introduced.  Curvelets  are  shown  to  provide  concrete  and 
constructive  optimal  nonlinear  approximations  to  smooth  functions  with 
twice  differentiable  singularities.  In  addition,  these  approximations  are 
obtained  simply  by  thresholding  the  curvelet  series. 


§1.  Introduction 

Throughout  the  sciences,  sparse  representations  of  classes  of  objects  are  of¬ 
ten  sought  because  of  the  well-known  applications  of  sparsity  to  problems 
ranging  from  data  compression  and  statistical  estimation  to  feature  detec¬ 
tion.  Indeed,  finding  sparse  representations  together  with  rapid  algorithms  to 
compute  them  is  one  of  the  main  objectives  of  a  rapidly  growing  field,  com¬ 
putational  harmonic  analysis  (CHA).  In  this  paper,  we  will  argue  that  CHA 
has  not  really  addressed  the  problem  of  efficiently  representing  smooth  mul¬ 
tivariate  functions  with  sharp  discontinuities,  like  smooth  images  with  edges. 
Motivated  by  this  gap  in  the  literature,  we  present  a  collection  of  new  rep¬ 
resentation  tools  that  efficiently  represent  smooth  functions  that  are  singular 
along  curves.  Here,  the  tone  is  expository;  details  may  be  found  in  the  cited 
references.  In  this  paper,  attention  is  restricted  to  the  two-dimensional  situa¬ 
tions  although  extensions  to  higher  dimensions  exist,  or  are  anticipated. 
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The  wavelet  miracle 

One  of  the  most  appealing  features  of  wavelet  systems  is  their  ability  to  provide 
efficient  representations  of  spatially  inhomogeneous  functions,  i.e.,  functions 
that  may  be  discontinuous,  spiky,  etc.  In  Mallat’s  words  “bases  of  smooth 
wavelets  are  the  best  bases  for  representing  objects  composed  of  singularities, 
when  there  may  be  an  arbitrary  number  of  singularities,  which  may  be  located 
in  all  possible  spatial  positions”  [8].  For  instance,  on  the  unit  interval  define 

m  =  H{t-to)g{t),  iG[0,l],  (1) 

where  H  is  the  Heavyside  H{t)  =  l{t>o}  and  p  is  a  smooth  arbitrary  function 
with  compact  support  and  finite  Sobolev  norm  ||p||w|  (see  [1]  for  the  classical 
definition  of  L2  Sobolev  norms).  Then,  the  number  of  Fourier  coefficients  of 
f  exceeding  1/n  in  absolute  value  is  bounded  below  by  c  •  n,  regardless  of 
the  degree  of  smoothness  of  /  away  from  the  singular  point  to-  This  means 
that  a  lot  of  different  terms  are  needed  to  obtain  good  partial  reconstructions; 
keeping  the  n  largest  terms  in  the  Fourier  series  gives  only  an  L2  error  of  ap¬ 
proximation  of  order  (Throughout  the  paper,  it  will  always  be  implicit 

that  the  error  is  measured  in  the  L2  norm.)  In  contrast,  the  sparsity  of  the 
wavelet  coefficient  sequence  of  /  is  in  some  sense  the  same  as  if  /  were  not  sin¬ 
gular.  In  effect,  the  number  of  wavelet  coefficients  exceeding  1/n  is  bounded 
by  giving  rates  of  approximation  of  order  n”^  corresponding  to 

the  nonlinear  bandwidth  of  W|  Sobolev  balls.  This  remarkable  adaptivity 
property  is  what  we  call  the  “wavelet  miracle.” 

The  curse 

Unfortunately,  wavelets  can  deal  with  point-like  singularities,  but  are  seriously 
challenged  by  line-like  singularities  in  dimension  two.  Let  us  for  instance 
consider  the  object 

f{xi,X2)  =  H(xicoseQ-Vx2smeQ-t(i)g[xi,X2),  (xi, X2)  €  [0, 1]^,  (2) 

where,  again,  ^  is  a  bivariate  function  taken  from  the  Sobolev  space  VF2 ;  /  is 
singular  on  the  line  Xi  cos  +  ^2  sin  =  ^o?  but  smooth  otherwise.  Then,  the 
number  of  wavelet  coefficients  exceeding  1/n  is  now  of  the  order  n.  Hence, 
partial  n-term  wavelet  reconstructions  will  only  converge  at  a  rate 
regardless  of  the  almost  everywhere  degree  s  of  smoothness.  The  edge  limits 
the  speed  of  convergence.  This  result  is  intuitively  not  very  surprising  as 
wavelet  bases  are  made  of  local  isotropic  oscillatory  bumps  at  various  scales, 
and  are  not  adapted  to  represent  long  elongated  structures  like  edges. 

This  clearly  raises  an  important  question:  in  two  dimensions  (and,  more 
generally,  in  arbitrary  d  dimensions)  can  we  develop  a  representation  enjoying 
the  same  adaptivity  features  as  wavelets  in  dimension  one? 
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§2.  Ridgelets  and  Linear  Singularities 

In  [3],  Candes  introduced  a  new  tiling  of  the  frequency  plane  that  led  to  the 
construction  of  ridgelet  frames.  We  say  that  a  collection  ((^„)  is  a  frame  of  a 
Hilbert  space  H  if  there  exist  two  constants  A,B>0  such  that  for  any  element 
of  H,  we  have 

n 

When  A  =  the  frame  is  said  to  be  tight.  A  collection  ((^„)  that  verifies 
the  frame  property  is  of  course  complete  and  there  is  a  very  concrete  way  to 
reconstruct  /  from  the  datum  of  its  coefficients  ((/,  V’n)if)-  Generalities  about 
frames  can  be  found  in  [11]. 

Let  be  a  univariate  oscillatory  function  and  'ipj,k{t)  =  —  k). 

The  ridgelet  frame  ipj,t,k  is  a  collection  of  ridge  functions  given  by 

^JAkiO  =  -  2^  2-^'^)  +  rPj,k{-\^\W  +  TT  -  27r  2-^) 

in  the  frequency  domain  [3]  {6  denotes  the  dirac  distribution). 

Donoho  [9]  modified  the  ridgelet  construction  by  essentially  replacing  the 
discretization  of  the  angular  variable  with  a  periodic  wavelet  transform  result¬ 
ing  in  an  orthonormal  basis.  He  called  these  new  basis  elements  orthonormal 
ridgelets.  In  the  remainder  of  this  paper,  we  make  the  choice  of  the  orthonor¬ 
mal  ridgelets,  although  all  the  results  and  constructions  that  follow  would 
hold  true  if  one  were  to  use  ‘pure  ridgelets.’ 

As  stated  in  [9],  such  a  system  can  be  defined  as  follows:  let  ('ipj,k{t))j,ke'z 
be  an  orthonormal  basis  of  Meyer  wavelets  for  L^(IR)  [12],  and  let 
0, . . . ,  2*°  —  1;  wj ^{9),  i  >  io,  ^  =  0, . . . ,  2*  —  1)  be  an  orthonormal  basis  for 
L^[0, 27r)  made  of  periodized  Lemarie  scaling  functions  at  level  io  and 
periodized  Meyer  wavelets  at  levels  i  >  io.  (We  suppose  a  particular 
normalization  of  these  functions.)  Let  denote  the  Fourier  transform 

of  'ipj,k(t),  and  define  ridgelets  pA(a^),  ^  =  Ujh‘i,i,e)  as  functions  of  a:  G 
using  the  frequency-domain  definition 

Mi)  =  +V’>,ifc(-|CI)<<(^  +  ’r))/2-  (3) 

Here  the  indices  run  as  follows:  A:  G  ZZ,  £  =  0, . . .  ,2*“^  —  1;  z  >  zq,  z  >  j. 
Notice  the  restrictions  on  the  range  of  £  and  on  i.  Let  A  denote  the  set  of  all 
such  indices  A.  It  turns  out  that  (pA)AeA  ^  complete  orthonormal  system 
for  £2(]r2).  Hence,  we  have  a  new  decomposition  of  the  form 

/  =  '^{f,Px)px- 

X 

Ridgelets  turn  out  to  be  optimal  for  representing  functions  with  linear 
singularities.  Indeed,  let  us  consider  the  template  (2).  The  following  theorem 
is  proved  in  [4]. 
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Theorem  1.  Let  g  G  Ty2*(I[l^)  and  f{xi,X2)  =  H{xicosOo  +  X2sm0o  - 
to)  g{xi,X2).  Then  the  sequence  (oa  =  (/jPa))  of  orthonormal  ridgelet  coeffi¬ 
cients  of  f  satisfies 


#{|aA|  >  IM  <  ||£?||vv- 

for  some  constant  C  not  depending  on  f.  As  a  consequence,  the  n-term 
approximation  fn  -  obtained  by  keeping  the  terms  corresponding  to  the  n 
largest  coefficients  in  the  ridgelet  expansion  -  satisfies 

\\f-f4<Cn-‘/^ 

Hence,  the  theorem  states  that  we  obtain  a  rate  of  approximation  as  if 
the  object  were  not  singular,  simply  by  thresholding  the  orthonormal  ridgelet 
expansion.  Whereas  the  singularity  caused  partial  wavelet  reconstructions  to 
converge  very  slowly,  its  effect  on  the  approximation  rate  of  truncated  ridgelet 
series  is  ‘harmless.’ 


§3.  Ridgelets  and  Curved  Edges. 

Theorem  1  considered  linear  singularities  and  it  seems  natural  to  ask  whether 
similar  results  will  hold  if  one  replaces  the  singularity  along  a  straight  line 
with  one  along  an  arbitrary  curve  7.  To  simplify  our  exposition,  consider  the 
simple  case  of  a  singular  function  defined  on  the  unit  square  by 

f{xi,X2)  =  g{xi,X2)  1{x2<7(xi)}>  (4) 

where  p  is  a  smooth  function  and  7  is  smooth  curve.  Then  the  ridgelet  coef¬ 
ficient  sequence  of  such  an  object  is  in  general  not  sparse: 

#{A,  |a;A|  >  f/n}  >  cn. 

Thus,  the  speed  of  convergence  of  the  best  n-term  ridgelet  approximation  is 
only  of  order  It  is  interesting  to  observe  that  the  degree  of  approxima¬ 

tion  of  both  wavelet  and  ridgelet  partial  reconstructions  is  the  same,  although 
they  correspond  to  radically  different  systems  of  representation.  Ridgelets  are 
elongated  and  directional,  whereas  wavelets  are  isotropic  and  local. 

The  limitations  that  we  presented  in  this  section  motivate  the  refinements 
and  new  tools  that  we  are  about  to  introduce. 

§4.  Monoscale  Ridgelets 

The  approach  developed  in  this  section  builds  on  Theorem  1.  The  idea  here 
is  to  take  advantage  of  the  optimal  representation  of  linear  singularities  by 
localizing  the  ridgelets.  A  detailed  exposition  is  provided  in  [5]. 

For  an  integer  5  >  0  and  integers  ^1,^2,  we  let  Q  be  the  dyadic  square 
defined  by  Q  =  [kil2\[ki  -h  l)/2")  x  [k2/2^,{k2  +  l)/2^).  The  collection  of 
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all  dyadic  squares  at  scale  s  will  be  denoted  by  Qs.  The  idea  is  to  smoothly 
localize  the  function  /  we  wish  to  represent  near  each  of  the  dyadic  squares 
of  Qs.  We  choose  an  orthonormal  partition  of  unity  wq;  that  is,  a  collection 
of  windows  such  that  Wq  is  a  partition  of  unity 

E  ""0  = 

QeQs 

The  following  details  a  way  of  making  up  such  an  orthonormal  partition: 
take  a  C°°  univariate  window  z/  supported  in  [—3/4, 3/4]  such  that  ^{t)  —  1 
on  [—1/2, 1/2];  define  vq  —  v[2^xi  —  ki)v{2^X2  —  k2)\  and  renormalize  the 
windows  vq  with 

WQ  =  vqK 

Q€Q, 

It  is  then  clear  that  the  wq^s  obey  the  desired  condition. 

Define  the  rescaling  operator  Tqg  by 

TQg  =  2^g(2^xi  —  ki^2^X2  —  A:2), 

which  is  an  isometry  of  L2.  Throughout  this  section,  s  is  arbitrary  but  fixed. 
Monoscale  ridgelets  are  defined  as  follows:  let  px  be  an  orthonormal  ridgelet 
basis  and  define 


'IpQ, X(XI,X2)  =  WQ{xi,X2){TQPx)ixi,X2y, 


the  collection 

{i^QyXiQ  ^  Qs')^  ^  (fi) 

is  what  we  call  the  monoscale  ridgelet  dictionary. 

It  is  easy  to  check  that  the  monoscale  ridgelet  dictionary  is  a  tight  frame 
of  ^2(11^)  as  we  have  a  Parseval  relationship 

11/111=  E 

Q€Q.  A 

Standard  arguments  show  that  we  then  have  the  decomposition 

QEQa  A 

with  equality  holding  in  an  L2  sense. 

We  add  an  “extra  layer  of  coarse  scale  coefficients”  to  eliminate  various 
artifacts.  Consider  a  standard  multiresolution  analysis  that  is  adapted  to 
the  unit  square  [7]  so  that  the  set  of  translates  {2®y?(2®  •  —k)},  k  —  (^1,^2), 
fcj  =  0, 1, . . . ,  2^  —  1  is  orthonormal.  Let  Pq  be  the  orthogonal  projector  onto 
14,  the  span  of  the  (^s.fc’s;  i.e., 

Pof  ■=  ^{f,^s,k)^s,k  :=  '^/33,k^s,k- 

k  k 
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The  following  Pythagorean  relationship  holds: 

ll/llHll^’o/|l^  +  ll(/-i’o)/|li.  (7) 

Finally,  define  the  coefficients 

=  (^/)  V’Q.a)  =  QcQsjAgA.  (8) 

Definition  1.  The  monoscale  ridgelet  transform  with  base  scale  s  is  the  map¬ 
ping  from  functions  f  G  ^2(111^)  to  the  amalgamation  of  coefficients  {Ps,k)  and 
(Os,//)' 

Note  that  we  again  have  a  partial  isometry 

ii/ii2  =  Eia,*i^+EK/-i'’ 

k  n 

thanks  to  the  Pythagorean  relationship  (7). 

Let  us  return  now  to  the  main  theme  of  this  paper,  and  study  the  ef¬ 
ficiency  of  monoscale  ridgelets  to  represent  objects  that  are  singular  along 
curves.  Suppose  that  one  is  interested  in  constructing  an  n-term  approxima¬ 
tion  of  the  function  /  in  (4).  Without  loss  of  generality,  we  will  suppose  that 
n  is  of  the  form  n  =  2^*^+^  We  simply  expand  /  in  the  monoscale  ridgelet 
dictionary  (5)  with  s  =  J  as  a  choice  of  base  scale]  that  is,  we  define  the 
n-term  approximation  by 


/„  =  Po/  +  Rn/2f,  (9) 

where  Rn/2f  is  the  partial  reconstruction  of  the  residual  Rf  obtained  by 
keeping  the  terms  corresponding  to  the  n/2  =  2?^  largest  coefficients 

It  is  interesting  to  observe  that  the  choice  of  the  base  scale  s  of  the 
monoscale  dictionary  depends  on  the  number  n  of  terms  we  wish  to  keep  in 
the  approximant.  We  have  the  following  result  [5]: 

Theorem  2.  Let  g  G  IFKIR^)  and  f{x)  =  1{i2<7(xi)}j  with  7  being 

three  times  differentiable.  Let  fn  be  the  n-term  approximation  defined  by  (9). 
Then, 

11/  -  fnh  <  C  max(n"^/^n"^/'^). 

This  simple  approximation  scheme  provides  optimal  rates  of  convergence 
as  long  as  s  <  3/2;  that  is,  approximation  bounds  as  if  /  were  not  singular. 
In  some  sense,  one  is  allowed  to  say  that  unlike  wavelets,  ridgelets  can  be 
adapted  to  provide  efficient  representations  of  curved  singularities.  There  is 
a  critical  value  5  =  3/2  of  the  smoothness  parameter,  however,  beyond  which 
the  method  saturates;  as  s  increases,  the  approximation  rate  is  blocked  at 
n-3/4  Nevertheless,  this  represents  already  a  substantial  improvement  over 
wavelet  approximations  whose  convergence  rates  are  blocked  at 
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Better  results  are  theoretically  possible.  For  instance,  let  T{C)  be  a 
model  of  smooth  images  with  twice  differentiable  edges  defined  as  follows: 

=  {/  :  /  satisfies  (4)  with  \\g\\w^  <  C  and  ||7||(^2  <  C}. 

The  condition  ||7||(^2  <  C  states  that  the  homogeneous  Holder  norm  of  order 
2  is  bounded  by  C.  In  other  words,  7  is  differentiable  and  its  first  derivative 
satisfies  the  Lipschitz  condition  |7^('it)  —  7^('y)|  C\u  —  v{.  For  this  class 
of  objects,  it  can  be  shown  that  there  are  reasonable  ways  of  constructing 
approximations  converging  at  the  rate  n“^logn.  Monoscale  ridgelets  do  not 
attain  this  optimal  rate. 

§5.  Curvelets  and  Curved  Singularities 

The  curvelet  transform  -  introduced  by  Candes  and  Donoho  in  [6]  -  is  the 
last  of  the  representation  tools  that  we  will  review.  Whereas  the  monoscale 
ridgelet  transform  involved  taking  ridgelet  coefl&cients  with  a  fixed  base  scale 
5,  the  curvelet  transform  spans  all  possible  scales  s  >  0.  A  useful  slogan  is  that 
the  curvelet  transform  is  obtained  by  filtering  and  then  applying  a  multiscale 
ridgelet  transform.  The  multiscale  ridgelet  dictionary  is  the  collection  of  the 
monoscale  dictionaries  at  all  possible  scales  s  >  0;  i.e., 

W/x  —  5  >  0,  g  €  Qs,  A  G  A}.  (10) 

The  curvelet  transform  requires  the  use  of  a  sequence  of  filters  that  we 
now  describe.  Let  $0  and  ^2s}  s  =  0, 1, 2, . . .  satisfy  the  following  properties; 

•  $0  is  a  lowpass  filter  and  is  concentrated  at  frequencies  |^|  <  2; 

•  ^2s  is  bandpass  and  concentrated  at  frequencies  |^|  G  [2^®~^,  2^®"''®]; 

•  the  filters  satisfy 

a>0 

Existence  and  constructions  of  such  filters  are  well-known.  The  last  relation¬ 
ship  implies  that  the  transformation  of  /  into  a  bank  of  functions 

/  {Pof  =  ^0  *  /,  Ao/  =  ’4^0  *  />  Ai/  =  liri  ,  As/  =  ^^25  *  /,  • .  ■) 

is  a  partial  isometry  in  the  sense  that 

II/II2  =  11^^0/111  + 5]  ||A,* /Hi. 

«>o 


Equipped  with  both  a  multiscale  ridgelet  dictionary  and  a  sequence  of 
filters,  define  the  curvelet  coefficient  a^,  of  /  by 


=  (As/,  V^q.a),  Q6Qs,AgA. 


(11) 
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Thus,  the  coefficient  is  interpreted  as  the  multiscale  ridgelet  coefficient  of 
a  piece  of  /  containing  information  at  frequencies  near  We  would  like  to 
point  out  that  there  is  a  quadratic  scaling  relationship  between  the  scale  2®  of 
the  multiscale  ridgelet  and  the  frequency  content,  localized  around  the  corona 
of  radius  2^%  of  the  piece  that  is  analyzed.  This  relationship  is  the  key  feature 
of  the  curvelet  transform. 

We  proceed  a  little  bit  differently  for  the  piece  of  /  containing  information 
at  low  frequencies  -Pq/*  Recall  the  orthogonal  collection  of  Lemarie-Meyer 
scaling  functions  Vk{xi,X2)  =  V{xi  -  ki,X2  -  ^2))  for  k  =  (^1,^2)  G  We 
make  the  choice  of  a  base  scale  so  that  Fo(0  —  f  f*^**  make 

sure  that  the  span  of  the  translates  I4  contains  the  range  of  the  projector 
Po/-  We  define  the  coarse  scale  curvelet  coefficients  by 

Pk  =  {Pof,Vk),  ke2l\ 

It  will  be  more  convenient  to  use  a  single  notation  to  index  the  set  of 
curvelet  coefficients;  the  notation  M'  will  stand  for  the  union  of  M  and  k  e 
TL^.  When  fie  M'\M,we  let  =  (3k. 

Definition  2.  The  curvelet  transform  is  the  mapping  that  associates  the  co- 
efficients  sequence  a^,  p  e  M'  to  an  arbitrary  square  integrable  function  f. 

We  will  call  curvelets  those  elements  defined  by 

Gy,  —  (12) 

with  an  obvious  modification  for  the  piece  corresponding  to  the  low  frequen¬ 
cies,  Gy  -  PoVfc.  2 

The  collection  of  curvelets  is  then  a  tight  frame  for  L2(1R  ) 

11/112= 

and,  of  course,  we  have  the  decomposition 

neM' 

with  equality  in  an  L2  sense. 

Let  /„  be  the  truncated  n-term  curvelet  series 

fn=  T,  “f‘l{KI>l“l(n))"<‘'  (^^^ 

The  following  theorem  is  proved  in  [6] . 

Theorem  3.  Let  g  e  W|{1R^)  and  f{x)  =  g{x)  with  7  being  two 

times  differentiable.  Let  fn  be  the  n-term  approximation  (15).  Then, 

11/ -/nib  <  Cn"*(logn)^/^ 

Again,  we  have  a  very  concrete  procedure  that  achieves  rates  of  approxi¬ 
mation  that  cannot  be  fundamentally  improved.  A  detailed  discussion  about 
the  optimality  of  this  result  is  in  [6]. 
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§6.  Conclusion 

In  this  paper,  we  presented  a  connected  set  of  ideas  originating  in  the  ridgelet 
transform  and  culminating  in  the  curvelet  transform.  We  have  shown  how 
these  representations  provide  efficient  representations  of  objects  that  are  sin¬ 
gular  along  curves.  These  tools,  however,  may  have  several  other  potential 
applications. 

Because  of  space  limitations,  we  set  aside  questions  related  to  the  prac¬ 
ticability  of  these  new  methods.  We  would  like  to  point  out  that  fast  al¬ 
gorithms  have  been  developed  to  implement  the  ridgelet,  monoscale  ridgelet 
and  curvelet  transform.  We  will  report  on  the  numerical  aspects  of  these 
transforms  in  a  separate  paper. 
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Curvelets:  A  Surprisingly  Effective  Nonadaptive 
Representation  for  Objects  with  Edges 


Emmanuel  J.  Candes  and  David  L.  Donoho 


Abstract.  It  is  widely  believed  that  to  efficiently  represent  an  otherwise 
smooth  object  with  discontinuities  along  edges,  one  must  use  an  adaptive 
representation  that  in  some  sense  ‘tracks’  the  shape  of  the  discontinuity 
set.  This  folk-belief  —  some  would  say  folk-theorem  —  is  incorrect.  At 
the  very  least,  the  possible  quantitative  advantage  of  such  adaptation  is 
vastly  smaller  than  commonly  believed.  We  have  recently  constructed  a 
tight  frame  of  curvelets  which  provides  stable,  efficient,  and  near-optimal 
representation  of  otherwise  smooth  objects  having  discontinuities  along 
smooth  curves.  By  applying  naive  thresholding  to  the  curvelet  transform 
of  such  an  object,  one  can  form  m-term  approximations  with  rate  of 
approximation  rivaling  the  rate  obtainable  by  complex  adaptive  schemes 
which  attempt  to  ‘track’  the  discontinuity  set.  In  this  article  we  explain 
the  basic  issues  of  efficient  m-term  approximation,  the  construction  of 
efficient  adaptive  representation,  the  construction  of  the  curvelet  frame, 
and  a  crude  analysis  of  the  performance  of  curvelet  schemes. 


§1.  Introduction 

In  many  important  imaging  applications,  images  exhibit  edges  -  discontinu¬ 
ities  across  curves.  In  traditional  photographic  imaging,  for  example,  this 
occurs  whenever  one  object  occludes  another,  causing  the  luminance  to  un¬ 
dergo  step  discontinuities  at  boundaries.  In  biological  imagery,  this  occurs 
whenever  two  different  organs  or  tissue  structures  meet. 

In  image  synthesis  applications,  such  as  CAD,  there  is  no  problem  in  deal¬ 
ing  with  such  discontinuities,  because  one  knows  where  they  are  and  builds  the 
discontinuities  into  the  representation  by  specially  adapting  the  representation 
—  for  example,  inserting  free  knots,  or  adaptive  refinement  rules. 

In  image  analysis  applications,  the  situation  is  different.  When  working 
with  real  rather  than  synthetic  data,  one  of  course  doesn’t  ‘know’  where  these 
edges  are;  one  only  has  a  digitized  pixel  array,  with  potential  imperfections 
caused  by  noise,  by  blurring,  and  of  course  by  the  unnatural  pixelization 
of  the  underlying  continuous  scene.  Hence  the  typical  image  analyst  only 
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has  recourse  to  representations  which  don’t  ‘know’  about  the  existence  and 
geometry  of  the  discontinuities  in  the  image. 

The  success  of  discontinuity-adapting  methods  in  CAD  and  related  image 
synthesis  fields  creates  a  temptation  for  an  image  analyst  -  a  temptation  to 
spend  a  great  deal  of  time  and  effort  importing  such  ideas  into  image  analysis. 
Almost  everyone  we  know  has  yielded  to  this  temptation  in  some  form,  which 
creates  a  possibility  for  surprise. 

Oracles  and  ideally-adapted  representation 

One  could  imagine  an  ideally-privileged  image  analyst  who  has  recourse  to 
an  oracle  able  to  reveal  the  positions  of  all  the  discontinuities  underlying  the 
image  formation.  It  seems  natural  that  this  ideally-privileged  analyst  could 
do  far  better  than  the  normally-endowed  analyst  who  knows  nothing  about 
the  position  of  the  discontinuities  in  the  image. 

To  elaborate  this  distinction,  we  introduce  terminology  borrowed  from 
fluid  dynamics,  where  ‘edges’  arise  in  the  form  of  fronts  or  shock  fronts. 

A  Lagrangian  representation  is  constructed  using  full  knowledge  of  the  intrinsic 
structure  of  the  object  and  adapting  perfectly  to  that  structure. 

•  In  fluid  dynamics  this  means  that  the  fluid  flow  pattern  is  known,  and 
one  constructs  a  coordinate  system  which  ‘flows  along  with  the  particles’, 
with  coordinates  mimicking  the  shape  of  the  flow  streamlines. 

•  In  image  representation  this  could  mean  that  the  edge  curves  are  known, 
and  one  constructs  an  image  representation  adapted  to  the  structure  of 
the  edge  curves.  For  example,  one  might  construct  a  basis  with  disconti¬ 
nuities  exactly  where  the  underlying  object  has  discontinuities. 

An  Eulerian  representation  is  fixed,  constructed  once  and  for  all.  It  is  non- 
adaptive  -  having  nothing  to  do  with  the  known  or  hypothesized  details  of 
the  underlying  object. 

•  In  fluid  dynamics,  this  would  mean  a  usual  euclidean  coordinate  system, 
one  that  does  not  depend  in  any  way  on  the  fluid  motion. 

•  In  image  representation,  this  could  mean  that  the  representation  is  some 
fixed  coordinate  representation,  such  as  wavelets  or  sinusoids,  which  does 
not  change  depending  on  the  positions  of  edges  in  the  image. 

It  is  quite  natural  to  suppose  that  the  Lagrangian  perspective,  when  it  is 
available,  is  much  more  powerful  that  the  Eulerian  one.  Having  the  privilege  of 
‘inside  information’  about  the  position  of  important  geometric  characteristics 
of  the  solution  seems  a  priori  rather  valuable.  In  fact,  this  position  has 
rather  a  large  following.  Much  recent  work  in  computational  harmonic  analysis 
(CHA)  attempts  to  find  bases  which  are  optimally  adapted  to  the  specific 
object  in  question  [7,10,11];  in  this  sense  much  of  the  ongoing  work  in  CHA 
is  based  on  the  presumption  that  the  Lagrangian  viewpoint  is  best. 

In  the  setting  of  edges  in  images,  there  has,  in  fact,  been  considerable 
interest  in  the  problem  of  developing  representations  which  are  adapted  to 
the  structure  of  discontinuities  in  the  object  being  studied.  The  (equivalent) 
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concepts  of  probing  and  minimum  entropy  segmentation  are  old  examples  of  this: 
wavelet  systems  which  are  specifically  constructed  to  allow  discontinuities  in 
the  basis  elements  at  specific  locations  [8,9].  More  recently,  we  are  aware 
of  much  informal  unpublished  or  preliminary  work  attempting  to  build  2D 
edge-adapted  schemes;  we  give  two  examples. 

•  Adaptive  triangulation  aims  to  represent  a  smooth  function  by  partition¬ 
ing  the  plane  into  a  sequence  of  triangular  meshes,  refining  the  meshes 
at  one  stage  to  create  finer  meshes  at  the  next  stage.  One  represents  the 
underlying  object  using  piecewise  linear  functions  supported  on  individ¬ 
ual  triangles.  It  is  easy  to  see  how,  in  an  image  synthesis  setting,  one 
can  in  principle  develop  a  triangulation  where  the  triangles  are  arranged 
to  track  a  discontinuity  very  faithfully,  with  the  bulk  of  refinement  steps 
allocated  to  refinements  near  the  discontinuity,  and  one  obtains  very  ef¬ 
fective  representation  of  the  object.  It  is  not  easy  to  see  how  to  do  this 
in  an  image  analysis  setting,  but  one  can  easily  be  persuaded  that  the 
development  of  adaptive  triangulation  schemes  for  noisy,  blurred  data  is 
an  important  and  interesting  project. 

•  In  an  adaptively  warped  wavelet  representation,  one  deforms  the  under¬ 
lying  image  so  that  the  object  being  analyzed  has  all  its  discontinuities 
aligned  purely  horizontal  or  vertical.  Then  one  analyzes  the  warped  ob¬ 
ject  in  a  basis  of  tensor-product  wavelets  where  elements  take  the  form 

This  is  very  effective  for  objects  which  are  smooth 
apart  from  purely  horizontal  and  purely  vertical  discontinuities.  Hence, 
the  warping  deforms  the  singularities  to  render  the  the  tensor  product 
scheme  very  effective.  It  is  again  not  easy  to  see  how  adaptive  warping 
could  work  in  an  image  analysis  setting,  but  one  is  easily  persuaded  that 
development  of  adaptively  warped  representations  for  noisy,  blurred  data 
is  an  important  and  interesting  project. 

Activity  to  build  such  adaptive  representations  is  based  on  an  article  of  faith: 
namely,  that  Eulerian  approaches  are  inferior,  that  oracle-driven  Lagrangian 
approaches  are  ideal,  and  that  one  should,  in  an  image  analysis  setting,  mimic 
Lagrangian  approaches,  attempting  empirically  to  estimate  from  noisy,  blurred 
data  the  information  that  an  oracle  would  supply,  and  build  an  adaptive  rep¬ 
resentation  based  on  that  information. 

Quantifying  rates  of  approximation 

In  order  to  get  away  from  articles  of  faith,  we  now  quantify  performance,  using 
an  asymptotic  viewpoint. 

Suppose  we  have  an  object  supported  in  [0, 1]^  which  has  a  discontinuity 
across  a  nice  curve  T,  and  which  is  otherwise  smooth.  Then  using  a  standard 
Fourier  representation,  and  approximating  with  built  from  the  best  m 
nonzero  Fourier  terms,  we  have 
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This  rather  slow  rate  of  approximation  is  improved  upon  by  wavelets.  The 
approximant  built  from  the  best  m  nonzero  wavelet  terms  satisfies 

ll/“/m  m-^OO.  (2) 

This  is  better  than  the  rate  of  Fourier  approximation,  and,  until  now,  is  the 
best  published  rate  for  a  fixed  non-adaptive  method  (i.e.  best  published  result 
for  an  ‘Eulerian  viewpoint’). 

On  the  other  hand,  we  will  discuss  below  a  method  which  is  adapted  to 
the  object  at  hand,  and  which  achieves  a  much  better  approximation  rate 
than  previously  known  ‘nonadaptive’  or  ‘Eulerian’  approaches.  This  adaptive 
method  selects  terms  from  an  overcomplete  dictionary  and  is  able  to  achieve 

Il/-/mll2 m^oo.  (3) 

Roughly  speaking,  the  terms  in  this  dictionary  amount  to  triangular  wedges, 
ideally  fitted  to  approximate  the  shape  of  the  discontinuity. 

Owing  to  the  apparent  trend  indicated  by  (l)-(3)  and  the  prevalence  of 
the  puritanical  belief  that  ‘you  can’t  get  something  for  nothing’,  one  might 
suppose  that  inevitably  would  follow  the 

Folk-Conjecture/ [Folk-Theorem].  The  result  (3)  for  adaptive  representa¬ 
tions  far  exceeds  the  rate  of  m-term  approximation  achievable  by  fixed  non¬ 
adaptive  representations. 

This  conjecture  appeals  to  a  number  of  widespread  beliefs: 

•  the  belief  that  adaptation  is  very  powerful, 

•  the  belief  that  the  way  to  represent  discontinuities  in  image  analysis  is  to 
mimic  the  approach  in  image  synthesis, 

•  the  belief  that  wavelets  give  the  best  fixed  nonadaptive  representation. 
In  private  discussions  with  many  respected  researchers  we  have  many 

times  heard  expressed  views  equivalent  to  the  purported  Folk-Theorem. 

The  surprise 

It  turns  out  that  performance  almost  equivalent  to  (3)  can  be  achieved  by  a 
nonadaptive  scheme.  In  other  words,  the  Folk-Theorem  is  effectively  false. 

There  is  a  tight  frame,  fixed  once  and  for  all  nonadaptively,  which  we  call 
a  frame  of  curvelets,  which  competes  surprisingly  well  with  the  ideal  adaptive 
rate  (3).  A  very  simple  m-term  approximation  -  summing  the  m  biggest  terms 
in  the  curvelet  frame  expansion  -  can  achieve 

\\f  -  ’Tn~'^{\ogmf,  m-^oo,  (4) 

which  is  nearly  as  good  as  (3)  as  regards  asymptotic  order.  In  short,  in  a 
problem  of  considerable  applied  relevance,  where  one  would  have  thought  that 
adaptive  representation  was  essentially  more  powerful  than  fixed  nonadaptive 
representation,  it  turns  out  that  a  new  fixed  nonadaptive  representation  is  es¬ 
sentially  as  good  as  adaptive  representation,  from  the  point  of  view  of  asymp¬ 
totic  m-term  approximation  errors.  As  one  might  expect,  the  new  nonadaptive 
representation  has  several  very  subtle  and  distinctive  features. 
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In  this  article,  we  would  like  to  give  the  reader  an  idea  of  why  (3)  represents 
the  ideal  behavior  of  an  adaptive  representation,  of  how  the  curvelet  frame  is 
constructed,  and  of  the  key  elements  responsible  for  (4).  We  will  also  attempt 
to  indicate  why  curvelets  perform  for  singularities  along  curves  the  task  that 
wavelets  perform  for  singularities  at  points. 

§2.  A  Precedent;  Wavelets  and  Point  Singularities 

We  mention  an  important  precedent  -  a  case  where  a  nonadaptive  scheme  is 
roughly  competitive  with  an  ideal  adaptive  scheme.  Suppose  we  have  a  piece- 
wise  polynomial  function  /  on  the  interval  [0, 1],  with  jump  discontinuities  at 
several  points. 

An  obvious  adaptive  representation  is  to  fit  a  piecewise  polynomial  with 
breakpoints  at  the  discontinuities.  If  there  are  P  pieces  and  each  polynomial 
is  of  degree  <  £),  then  we  need  only  keep  P  •  (D  +  1)  coefficients  and  P  —  1 
breakpoints  to  exactly  represent  this  function.  Common  sense  tells  us  that 
this  is  the  natural,  and  even,  the  ideal  representation  for  such  a  function. 

To  build  this  representation,  we  need  to  know  locations  of  the  discontinu¬ 
ities.  If  the  measurements  are  noisy  or  blurred,  and  if  we  don’t  have  recourse 
to  an  oracle,  then  we  can’t  necessarily  build  this  representation. 

A  less  obvious  but  much  more  robust  representation  is  to  take  a  nice 
wavelet  transform  of  the  object,  and  keep  the  few  resulting  nonzero  wavelet 
coefficients.  If  we  have  an  A-point  digital  signal  f{i/N),  1  <  i  <  N,  and 
we  use  Daubechies  wavelets  of  compact  support,  then  there  are  no  more  than 
C  •  log2(iV) '  P  '  {D  +  1)  nonzero  wavelet  coefficients  for  the  digital  signal. 

In  short,  the  nonadaptive  representation  needs  only  to  keep  a  factor 
C'log2(A')  more  data  to  give  an  equally  faithful  representation. 

We  claim  that  this  phenomenon  is  at  least  partially  responsible  for  the 
widespread  success  of  wavelet  methods  in  data  compression  settings.  One  can 
build  a  single  fast  transform  and  deal  with  a  wide  range  of  different  /,  with 
different  discontinuity  sets,  without  recourse  to  an  oracle. 

In  particular,  since  one  almost  never  has  access  to  an  oracle,  the  nat¬ 
ural  first  impulse  of  one  committed  to  the  adaptive  viewpoint  would  be  to 
‘estimate’  the  break  points  -  i.e.  to  perform  some  sort  of  edge  detection.  Un¬ 
fortunately  this  is  problematic  when  one  is  dealing  with  noisy  blurred  data. 
Edge  detection  is  a  whole  topic  in  itself  which  has  thousands  of  proposed  so¬ 
lutions  and  (evidently,  as  one  can  see  from  the  continuing  rate  of  publication 
in  this  area)  no  convincing  solution. 

In  using  wavelets,  one  does  not  need  edge  detectors  or  any  other  prob¬ 
lematic  schemes,  one  simply  extracts  the  big  coefficients  from  the  transform 
domain,  and  records  their  values  and  positions  in  an  organized  fashion. 

We  can  lend  a  useful  perspective  to  this  phenomenon  by  noticing  that  the 
discontinuities  in  the  underlying  /  are  point  singularities,  and  we  are  saying 
that  wavelets  need  in  some  sense  at  most  log(n)  coefficients  to  represent  a 
point  singularity  out  to  scale  1/n. 
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It  turns  out  that  even  in  higher  dimensions,  wavelets  have  a  near-ideal 
ability  to  represent  objects  with  point  singularities. 

The  two-dimensional  object  f(3{xi,X2)  =  l/((3^i  —  1/2)^  +  (^2  -  1/2)^)^ 
has,  for  /?  <  1/2,  a  square- integrable  singularity  at  the  point  (1/2, 1/2)  and  is 
otherwise  smooth.  At  each  level  of  the  2D  wavelet  pyramid,  there  are  effec¬ 
tively  only  a  few  wavelets  which  ‘feel’  the  point  singularity,  other  coefficients 
being  effectively  negligible.  In  approximation  out  to  scale  1/n,  only  about 
0(log(n))  coefficients  are  required. 

Another  approach  to  understanding  the  representation  of  singularities, 
which  is  not  limited  by  scale,  is  to  consider  rates  of  decay  of  the  countable 
coefficient  sequence.  Analysis  of  wavelet  coefficients  of  shows  that  for  any 
desired  rate  p,  the  N-th  largest  coefficient  can  be  bounded  by  CpN~^  for  all 
N.  In  short,  the  wavelet  coefficients  of  such  an  object  are  very  sparse. 

Thus  we  have  a  slogan:  wavelets  perform  very  well  for  objects  with  point 
singularities  in  dimensions  1  and  2. 

§3.  Failure  of  Wavelets  on  Edges 

We  now  briefly  sketch  why  wavelets,  which  worked  surprisingly  well  in  repre¬ 
senting  point  discontinuities  in  dimension  1,  are  less  successful  dealing  with 
‘edge’  discontinuities  in  dimension  2. 

Suppose  we  have  an  object  /  on  the  square  [0, 1]^  and  that  /  is  smooth 
away  from  a  discontinuity  along  a  curve  T.  Let’s  look  at  the  number  of 
substantial  wavelet  coefficients. 

A  grid  of  squares  of  side  2~^  by  2~^  has  order  2^  squares  intersecting  T. 
At  level  j  of  the  two-dimensional  wavelet  pyramid,  each  wavelet  is  localized 
near  a  corresponding  square  of  side  2~^  by  2"^ .  There  are  therefore  0(2^  ) 
wavelets  which  ‘feel’  the  discontinuity  along  F.  Such  a  wavelet  coefficient  is 
controlled  by 

and  in  effect  no  better  control  is  available,  since  the  object  /  is  not  smooth 
within  the  support  of  V’j.Jfci.fca  [14]-  Therefore  there  are  about  2^  coefficients  of 
size  about  2~K  In  short,  the  iV-th  largest  wavelet  coefficient  is  of  size  about 
l/N.  The  result  (2)  follows. 

We  can  summarize  this  by  saying  that  in  dimension  2,  discontinuities 
across  edges  are  spatially  distributed;  because  of  this  they  can  interact  rather 
extensively  with  many  terms  in  the  wavelet  expansion,  and  so  the  wavelet 
representation  is  not  sparse. 

In  short,  wavelets  do  well  for  point  singularities,  and  not  for  singularities 
along  curves.  The  success  of  wavelets  in  dimension  1  derived  from  the  fact 
that  all  singularities  in  dimension  1  are  point  singularities,  so  wavelets  have 
a  certain  universality  there.  In  higher  dimensions  there  are  more  types  of 
singularities,  and  wavelets  lose  their  universality. 

For  balance,  we  need  to  say  that  wavelets  do  outperform  classical  meth¬ 
ods.  If  we  used  sinusoids  to  represent  an  object  of  the  above  type,  then  we 
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have  the  result  (1),  which  is  far  worse  than  that  provided  by  wavelets.  For 
completeness,  we  sketch  the  argument.  Suppose  we  use  for  ‘sinusoids’  the 
complex  exponentials  on  [~7r,  tt]^,  and  that  the  object  /  tends  smoothly  to 
zero  at  the  boundary  of  the  square  [0,1]^,  so  that  we  may  naturally  extend 
it  to  a  function  living  on  [— tt,  tt]^.  Now  typically  the  Fourier  coefficients  of 
an  otherwise  smooth  object  with  a  discontinuity  along  a  curve  decay  with 
wavenumber  as  (the  very  well-known  example  is  /  =  indicator  of  a 

disk,  which  has  a  Fourier  transform  described  by  Bessel  functions).  Thus 
there  are  about  B?  coefficients  of  size  >  c  •  meaning  that  the  N-th 

largest  is  of  size  >  c  •  from  which  (1)  follows. 

In  short:  neither  wavelets  nor  sinusoids  really  sparsify  two-dimensional 
objects  with  edges  (although  wavelets  are  better  than  sinusoids). 

§4.  Ideal  Representation  of  Objects  with  Edges 

We  now  consider  the  optimality  result  (3),  which  is  really  two  assertions.  On 
the  one  hand,  no  reasonable  scheme  can  do  better  than  this  rate.  On  the 
other  hand,  there  is  a  certain  adaptive  scheme,  with  intimate  connections  to 
adaptive  triangulation,  which  achieves  it.  For  more  extensive  discussion  see 

[10,11,13]. 

In  talking  about  adaptive  representations,  we  need  to  define  terms  care¬ 
fully,  for  the  following  reason.  For  any  /,  there  is  always  an  adaptive  repre¬ 
sentation  of  /  that  does  very  well:  namely  the  orthobasis  ^  =  {V'o?  V'l)  •  •  •} 
with  first  element  V'o  =  //II/II2!  This  is,  in  a  certain  conception,  an  ‘ideal 
representation’  where  each  object  requires  only  one  nonzero  coefficient.  In  a 
certain  sense  it  is  a  useless  one,  since  all  information  about  /  has  been  hidden 
in  the  definition  of  representation,  so  actually  we  haven’t  learned  anything. 
Most  of  our  work  in  this  section  is  in  setting  up  a  notion  of  adaptation  that 
will  free  us  from  fear  of  being  trapped  at  this  level  of  triviality. 

Dictionaries  of  atoms 

Suppose  we  are  interested  in  approximating  a  function  in  L^(T),  and  we  have  a 
countable  collection  V  =  {0}  of  atoms  in  L^(T);  this  could  be  a  basis,  a  frame, 
a  finite  concatenation  of  bases  or  frames,  or  something  even  less  structured. 

We  consider  the  problem  of  m-term  approximation  from  this  dictionary, 
where  we  are  allowed  to  select  m  terms  0i , . . . ,  from  V  and  we  approximate 
/  from  the  L^-closest  member  of  the  subspace  they  span: 


fm  =  Proi{/|span(0i, . . . ,  (/.^)}. 


We  are  interested  in  the  behavior  of  the  m-term  approximation  error 


em(f;'D)  =  \\f-U\l 


where  in  this  provisional  definition,  we  assume  fm  is  a  best  approximation  of 
this  form  after  optimizing  over  the  selection  of  m  terms  from  the  dictionary. 


112 


E.  J.  Candh  and  D.  L.  Donoho 


However,  to  avoid  a  trivial  result,  we  impose  regularity  on  the  selection 
process.  Indeed,  we  allow  rather  arbitrary  dictionaries,  including  ones  which 
enumerate  a  dense  subset  of  L^(T),  so  that  in  some  sense  the  trivial  result 
01  =  //II/II2  =  0,  Vm  is  always  a  lurking  possibility.  To  avoid  this 
possibility  we  forbid  arbitrary  selection  rules.  Following  [10]  we  propose 

Definition.  A  sequence  of  selection  rules  (amC*))  choosing  m  terms  from  a 
dictionary  V, 

^m(/)  ~  0m)  5 

is  said  to  implement  polynomial  depth  search  if  there  is  a  single  fixed  enumer¬ 
ation  of  the  dictionary  elements  and  a  fixed  polynomial  7r(t)  such  that  terms 
in  (Jm{f)  come  from  the  first  7r(m)  elements  in  the  dictionary. 

Under  this  definition,  the  trivial  representation  based  on  a  countable 
dense  dictionary  is  not  generally  available,  since  in  any  fixed  enumeration, 
a  decent  1-term  approximation  to  typical  /  will  typically  be  so  deep  in  the 
enumeration  as  to  be  unavailable  for  polynomial-depth  selection.  (Of  course, 
one  can  make  this  statement  quantitative,  using  information-theoretic  ideas). 

More  fundamentally,  our  definition  not  only  forbids  trivialities,  but  it 
allows  us  to  speak  of  optimal  dictionaries  and  get  meaningful  results.  Starting 
now,  we  think  of  dictionaries  as  ordered,  having  a  first  element,  second  element, 
etc.,  so  that  different  enumerations  of  the  same  collection  of  functions  are 
different  dictionaries.  We  define  the  m-optimal  approximation  number  for 
dictionary  V  and  limit  polynomial  tt  as 

where  fm  is  constructed  by  optimizing  the  choice  of  m  atoms  among  the  first 
7r(m)  in  the  fixed  enumeration.  Note  that  we  use  squared  error  for  comparison 
with  (l)-(3)  in  the  Introduction. 


Approximating  classes  of  functions 

Suppose  we  now  have  a  class  T  of  functions  whose  members  we  wish  to  ap¬ 
proximate.  Suppose  we  are  given  a  countable  dictionary  V  and  polynomial 
depth  search  delimited  by  polynomial  7r(-). 

Define  the  error  of  approximation  by  this  dictionary  over  this  class  by 

em{J^]T>,7r)  =  maxe^(/;X),7r). 

We  may  find,  in  certain  examples,  that  we  can  establish  bounds 
em{^y'D,Tv)  =  m  — ^  00, 


for  all  p  <  p*-  At  the  same  time,  we  may  have  available  an  argument  showing 
that  for  every  dictionary  and  every  polynomial  depth  search  rule  delimited  by 

^  cm  ^  ,  m  >  mo(7r). 


Then  it  seems  natural  to  say  that  p*  is  the  optimal  rate  of  m-term  approxi¬ 
mation  from  any  dictionary  when  polynomial  depth  search  delimited  by  7r(-). 
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Starshaped  objects  with  boundaries 

We  define  Star-Set^ (C),  a  class  of  star-shaped  sets  with  C^-smooth  bound¬ 
aries,  by  imposing  regularity  on  the  boundaries  using  a  kind  of  polar  coor¬ 
dinate  system.  Let  p{0)  :  [0,27r)  — >•  [0,1]  be  a  radius  function  and  bo  = 
(2^1,0)  3^2, o)  be  an  origin  with  respect  to  which  the  set  of  interest  is  star-shaped. 
With  6i{x)  =  Xi  -  Xi^o,  i  =  1,2,  define  functions  6{xi,X2)  and  r(a:i,a:2)  by 

6  =  arctan(-(52/^i);  r  =  ((<5i)^  + 

For  a  starshaped  set,  we  have  {x\^X2)  G  -B  iff  0  <  r  <  p{0).  Define  the  class 
Star-Set^ (C)  of  sets  by 

{B:BC  [i,  A]^  i  <  p(e)  <  i  0e  10,2,r),  p  €  |/i(e)|  <  C}, 

and  consider  the  corresponding  functional  class 

Star2(C)  =  {/  -  1b:  B  G  Star-Set='(C)}  . 

The  following  lower  rate  bound  should  be  compared  with  (3). 

Lemma.  Let  the  polynomial  7r(-)  be  given.  There  is  a  constant  c  so  that,  for 
every  dictionary  V, 

e„(Star^(C);2>,,r)  >  m  -  oo. 

This  is  proved  in  [10]  by  the  technique  of  hypercube  embedding.  Inside 
the  class  Star^(C')  one  can  embed  very  high-dimensional  hypercubes,  and  the 
ability  of  a  dictionary  to  represent  all  members  of  a  hypercube  of  dimension  n 
by  selecting  m  <^n  terms  from  a  subdictionary  of  size  7r(m)  is  highly  limited 
if  7r(m)  grows  only  polynomially. 

For  each  /,  a  corresponding  orthobasis  is  adaptively  constructed  in  [13] 
which  achieves  the  rate  (3).  It  tracks  the  boundary  of  B  at  increasing  accuracy 
using  a  sequence  of  polygons;  in  fact  these  are  n-gons  connecting  equispaced 
points  along  the  boundary  of  B,  for  n  =  2^ .  The  difference  between  n-gons 
for  n  =  2^  and  n  =  2^'^^  is  a  collection  of  thin  triangular  regions  obeying 
width  7i:i  length"^',  taking  the  indicators  of  each  region  as  a  term  in  a  basis, 
one  gets  an  orthonormal  basis  whose  terms  at  fine  scales  are  thin  triangular 
pieces.  Estimating  the  coefficient  sizes  by  simple  geometric  analysis  leads  to 
the  result  (3).  In  fact,  [13]  shows  how  to  do  this  under  the  constraint  of 
polynomial-depth  selection,  with  polynomial  Cm^. 

Although  space  constraints  prohibit  a  full  explanation,  our  polynomial- 
depth  search  formalism  also  makes  perfect  sense  in  discussing  the  warped 
wavelet  representations  of  the  Introduction.  Consider  the  noncountable  ‘dic¬ 
tionary’  of  all  wavelets  in  a  given  basis,  with  all  continuum  warpings  applied. 
Notice  that  for  wavelets  at  a  given  fixed  scale,  warpings  can  be  quantized  with 
a  certain  finite  accuracy.  Carefully  specifying  the  quantization  of  the  warping, 
one  obtains  a  countable  collection  of  warped  wavelets,  for  which  polynomial 
depth  search  constraints  make  sense,  and  which  is  as  effective  as  adaptive 
triangulation,  but  not  more  so.  Hence  (3)  applies  to  (properly  interpreted) 
deformation  methods  as  well. 
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§5.  Curvelet  Construction 

We  now  briefly  describe  the  curvelet  construction.  It  is  based  on  combining 
several  ideas,  which  we  briefly  review: 

•  Ridgelets,  a  method  of  analysis  suitable  for  objects  with  discontinuities 
across  straight  lines. 

•  Multiscale  Ridgelets,  a  pyramid  of  windowed  ridgelets,  renormalized  and 
transported  to  a  wide  range  of  scales  and  locations. 

•  Bandpass  Filtering,  a  method  of  separating  an  object  into  a  series  of  dis¬ 
joint  scales. 

We  briefly  describe  each  idea  in  turn,  and  then  their  combination. 

Ridgelets 

The  theory  of  ridgelets  was  developed  in  the  Ph.D.  Thesis  of  Emmanuel 
Candes  (1998).  In  that  work,  Candes  showed  that  one  could  develop  a  system 
of  analysis  based  on  ridge  functions 

cos(0)  -f  a:2sin(^)  -  6)/a).  (5) 

He  introduced  a  continuous  ridgelet  transform  R/(a,  6,  B)  —  {'ipa,b,9{x),  f)  with 
a  reproducing  formula  and  a  Parseval  relation.  He  also  constructed  frames, 
giving  stable  series  expansions  in  terms  of  a  special  discrete  collection  of  ridge 
functions.  The  approach  was  general,  and  gave  ridgelet  frames  for  functions 
in  T^[0, 1]^  in  all  dimensions  d>  2  -  For  further  developments,  see  [3,5]. 

Donoho  [12]  showed  that  in  two  dimensions,  by  heeding  the  sampling  pat¬ 
tern  underlying  the  ridgelet  frame,  one  could  develop  an  orthonormal  set  for 
L^(]R^)  having  the  same  applications  as  the  original  ridgelets.  The  orthonor¬ 
mal  ridgelets  are  convenient  to  use  for  the  curvelet  construction,  although  it 
seems  clear  that  the  original  ridgelet  frames  could  also  be  used.  The  ortho- 
ridgelets  are  indexed  using  A  =  {j,  /c,  f,  e),  where  j  indexes  the  ridge  scale,  k 
the  ridge  location,  i  the  angular  scale,  and  E  the  angular  location;  e  is  a  gender 
token.  Roughly  speaking,  the  ortho-ridgelets  look  like  pieces  of  ridgelets  (5) 
which  are  windowed  to  lie  in  discs  of  radius  about  2*;  Bi/  =  i/2^  is  roughly 
the  orientation  parameter,  and  2~^  is  roughly  the  thickness. 

A  formula  for  ortho-ridgelets  can  be  given  in  the  frequency  domain 

Mi)  =  l4r^{iAj,fc(KIX((6')  +  i>jA-\i\)^li{^  +  ’r))/2 . 

Here  the  are  Meyer  wavelets  for  IR,  ^  are  periodic  wavelets  for  [— tt,  tt), 
and  indices  run  as  follows:  j,  A;  G  2Z,  ^  =  0, . . . ,  2^“^  —  1;  i  >  1,  and,  if  e  =  0, 
i  =  max(l,  j),  while  if  e  =  1,  z  >  max(l,  j).  We  let  A  be  the  set  of  such  A. 
The  formula  is  an  operationalization  of  the  ridgelet  sampling  principle: 

•  Divide  the  frequency  domain  in  dyadic  coronae  |^|  G  [2-^,  2-^'^^]. 

•  In  the  angular  direction,  sample  the  j-th  corona  at  least  2^  times. 

•  In  the  radial  frequency  direction,  sample  behavior  using  local  cosines. 


Curvelets 


115 


The  sampling  principle  can  be  motivated  by  the  behavior  of  Fourier  trans¬ 
forms  of  functions  with  singularities  along  lines.  Such  functions  have  Fourier 
transforms  which  decay  slowly  along  associated  lines  through  the  origin  in  the 
frequency  domain.  As  one  traverses  a  constant  radius  arc  in  Fourier  space, 
one  encounters  a  ‘Fourier  ridge’  when  crossing  the  line  of  slow  decay.  The 
ridgelet  sampling  scheme  tries  to  represent  such  Fourier  transforms  by  using 
wavelets  in  the  angular  direction,  so  that  the  ‘Fourier  ridge’  is  captured  neatly 
by  one  or  a  few  wavelets.  In  the  radial  direction,  the  Fourier  ridge  is  actu¬ 
ally  oscillatory,  and  this  is  captured  by  local  cosines.  A  precise  quantitative 
treatment  is  given  in  [4] . 

Multiscale  ridgelets 

Think  of  ortho-ridgelets  as  objects  which  have  a  “length”  of  about  1  and  a 
“width”  which  can  be  arbitrarily  fine.  The  multiscale  ridgelet  system  renor¬ 
malizes  and  transports  such  objects,  so  that  one  has  a  system  of  elements  at 
all  lengths  and  all  finer  widths. 

In  a  light  mood,  we  may  describe  the  system  impressionistically  as  “brush 
strokes”  with  a  variety  of  lengths,  thicknesses,  orientations  and  locations. 

The  construction  employs  a  nonnegative,  smooth  partition  of  energy 
function  w,  obeying  -  ^2)  =  Define  a  transport 

operator,  so  that  with  index  Q  indicating  a  dyadic  square  Q  =  (5,/ci,fc2) 
of  the  form  [ki/2\{ki  +  l)/2^)  x  [A:2/2^(fc2  +  l)/2^),  by  (TQf){xuX2)  = 
f{2^xi  -  ki,2^X2  —  k2)-  The  Muitiscale  Ridgelet  with  index  //  =  (Q,  X)  is  then 

=  2*  •Tq(w  -  Pa). 

In  short,  one  transports  the  normalized,  windowed  ortho-ridgelet. 

Letting  Qs  denote  the  dyadic  squares  of  side  2~®,  we  can  define  the 
subcollection  of  Monoscale  Ridgelets  at  scale  s: 

Ms  =  {{Q,X):QeQs,XeA}. 

Orthonormality  of  the  ridgelets  implies  that  each  system  of  monoscale  ridgelets 
makes  a  tight  frame,  in  particular  obeying  the  Parseval  relation 

tieMs 

It  follows  that  the  dictionary  of  multiscale  ridgelets  at  all  scales,  indexed  by 


M  = 

is  not  frameable,  as  we  have  energy  blow-up: 

E  sf  = 


(6) 
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The  Multiscale  Ridgelets  dictionary  is  simply  too  massive  to  form  a  good  ana¬ 
lyzing  set.  It  lacks  inter-scale  orthogonality  -  '^(Q.A)  typically  orthogonal 
to  if  Q  Q'  are  squares  at  different  scales  and  overlapping  loca¬ 

tions.  In  analyzing  a  function  using  this  dictionary,  the  repeated  interactions 
with  all  different  scales  causes  energy  blow-up  (6). 

Our  construction  of  curvelets  solves  (6)  by  disallowing  the  full  richness  of 
the  Multiscale  Ridgelets  dictionary.  Instead  of  allowing  ‘brushstrokes’  of  all 
different  ‘lengths’  and  ‘widths’,  we  allow  only  those  where  width  ^  length^. 

Subband  filtering 

Our  solution  to  the  ‘energy  blow-up’  (6)  is  to  decompose  /  into  subbands  using 
standard  filterbank  ideas.  Then  we  assign  one  specific  monoscale  dictionary 
Ms  analyze  one  specific  (and  specially  chosen)  subband. 

We  define  coronae  of  frequencies  |^|  G  [2^®,  2^®'''^],  and  subband  filters 
extracting  components  of  /  in  the  indicated  subbands;  a  filter  Pq  deals  with 
frequencies  |^|  <  1.  The  filters  decompose  the  energy  exactly  into  subbands: 


The  construction  of  such  operators  is  standard  [15];  the  coronization  oriented 
around  powers  2^®  is  nonstandard  -  and  essential  for  us.  Explicitly,  we  build  a 
sequence  of  filters  and  ^2s  =  2“^® ^(2^®-),  s  =  0, 1, 2, . . .  with  the  following 
properties:  $o  is  a  lowpass  filter  concentrated  near  frequencies  j^j  <  1;  is 
bandpass,  concentrated  near  |^|  G  [2^®,  2^®+^];  and  we  have 

S>0 


Hence,  is  simply  the  convolution  operator  A^/  =  *  /• 

Definition  of  curvelet  transform 

Assembling  the  above  ingredients,  we  are  able  to  sketch  the  definition  of  the 
Curvelet  transform.  We  let  M'  consist  of  M  merged  with  the  collection  of 
integral  pairs  (A:i,fc2)  indexing  unit-side  squares  in  the  plane. 

The  curvelet  transform  is  a  map  L^(IR^)  t— ^  yielding  Curvelet 

coefficients  G  M').  These  come  in  two  types.  At  coarse  scale  we  have 

wavelet  scaling  function  coefficients 

tt/z  =  =  {kuk2)  e  M\M, 

where  (f>ki,k2  is  the  Lemarie  scaling  function  of  the  Meyer  basis,  while  at  fine 
scale  we  have  Multiscale  Ridgelets  coefficients  of  the  bandpass  filtered  object: 


=  (As/,'0^),  fj,  e  Ms, s  =  1,2,..,. 
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Note  well  that  each  coefficient  associated  to  scale  2  ®  derives  from  the  subband 
filtered  version  of  /  -  Asf  -  and  not  from  f.  Several  properties  are  immediate: 

•  Tight  Frame: 

ll/lli  =  E  Kl'- 

•  Existence  of  Coefficient  Representers  (Frame  Elements): 


•  Reconstruction  Formula: 

/  =  E  (/’>>>■ 

ti^M' 


•  Formula  for  Frame  Elements: 

7^  =  (1  e  Qs. 

In  short,  the  curvelets  are  obtained  by  bandpass  filtering  of  Multiscale 
Ridgelets  with  passband  is  rigidly  linked  to  the  scale  of  spatial  localization 

•  Anisotropy  Scaling  Law:  Linking  the  filter  passband  |^|  ~  2^®  to  the 
spatial  scale  2“®  imposes  that  (1)  most  curvelets  are  negligible  in  norm 
(most  multiscale  ridgelets  do  not  survive  the  bandpass  filtering  Ag);  (2) 
the  non-negligible  curvelets  obey  length  2~®  while  width  ph  2~^®.  So 
the  system  obeys  approximately  the  scaling  relationship 

width  ^  length^. 

It  is  here  that  the  2^®  coronization  scheme  comes  into  play. 

§6.  Why  Should  This  Work? 

The  curvelet  decomposition  can  be  equivalently  stated  in  the  following  form: 

•  Subband  Decomposition.  The  object  /  is  filtered  into  subbands: 

/h^(Po/,Ai/,A2/,...). 

•  Smooth  Partitioning.  Each  subband  is  smoothly  windowed  into  “squares” 
of  an  appropriate  scale: 


Asf^(wQAsf)QeQs‘ 

•  Renormalization.  Each  resulting  square  is  renormalized  to  unit  scale 


gQ-2  ^(Tq)  ^{wqAJ),  QeQs‘ 
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•  Ridgelet  Analysis.  Each  square  is  analyzed  in  the  ortho-ridgelet  system 

=  {9Q,Px)^  fi  =  {Q,X). 

We  can  now  give  a  crude  explanation  of  why  the  main  result  (4)  holds. 
Effectively,  the  bandpass  images  As/  are  almost  vanishing  at  x  far  from  the 
edges  in  /.  Along  edges,  the  bandpass  images  exhibit  ridges  of  width  2“^® 
-  the  width  of  the  underlying  bandpass  filter. 

The  partitioned  bandpass  images  are  broken  into  squares  of  side  2""^  x2“^. 
The  squares  which  do  not  intersect  edges  have  effectively  no  energy,  and  we 
ignore  them.  The  squares  which  do  intersect  edges  result  from  taking  a  nearly- 
straight  ridge  and  windowing  it.  Thus  the  squares  which  ‘matter’  exhibit 
tiny  ridge  fragments  of  aspect  ratio  2“®  by  2“^®.  After  renormalization,  the 
resulting  gq  exhibits  a  ridge  fragment  of  about  unit  length  and  of  width  2“^. 
The  ridge  fragment  is  then  analyzed  in  the  ortho-ridgelet  system,  which  should 
(we  hope)  yield  only  a  few  significant  coefficients. 

In  fact,  simple  arguments  of  size  and  order  give  an  idea  how  the  curvelet 
coefficients  roughly  behave.  We  give  an  extremely  loose  description. 

First,  at  scale  2“®,  there  are  only  about  0(2®)  squares  Q  ^  Qs  that 
interact  with  the  edges.  Calculating  the  energy  in  such  a  square  using  the  size 
of  the  support  and  the  height  of  the  ridge  leads  to 

(length  •  width)^/^  •  height  (2”®  x  2“^®)^/^  x  1. 

Indeed,  the  height  of  the  ridge  is  bounded  by 

||A  JIU  =  11^2.  *  /Hoc  <  ||^2s||l||/||oo  =  ll^lllll/lloo. 

Since  we  are  interested  in  uniformly  bounded  functions  /,  the  height  is  thus 
bounded  by  a  constant  C.  The  calculation  of  the  norm  |l5fQ|l2  ~  2“^/^  follows 
immediately  (because  of  the  renormalization,  the  height  of  the  ridge  gq  is  now 
2"®)  .  Now  temporarily  suppose  that  for  some  fixed  K  not  depending  on  Q, 

each  ridge  fragment  gq  is  a  sum  of  at  most  K  ortho-ridgelets.  (7) 

This  would  imply  that  at  level  s  we  have  a  total  number  of  coefficients 

0(2*)  squares  which  ‘matter’  x  A'coefficients/square, 

while  the  norm  estimate  for  gq  and  the  orthonormality  of  ridgelets  give 

coefficient  amplitude  <  C  ■  2"^*/^. 

The  above  assumptions  imply  that  the  N-th  largest  curvelet  coefficient  is  of 
size  <  C  •  Letting  \ct\(N)  denote  the  N-th.  coefficient  amplitude,  the 

tail  sum  of  squares  would  obey 

1]  i“l(w)  <  C  ■  (8) 

N>m 
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This  coefficient  decay  leads  to  (4)  as  follows.  Let  fii,.. .  ,/Lim  enumerate 
indices  of  the  m  largest  curvelet  coefficients.  Build  the  m-term  approximation 

m 

fm  — 

2  =  1 

By  the  tight  frame  property, 

\\f-fZr<  E  HN),<C-m-\ 

N=m+1 

where  the  last  step  used  (8).  This  of  course  would  establish  (4)  -  in  fact 
something  even  stronger,  something  fully  as  good  as  (3). 

However,  we  have  temporarily  assumed  (7)  -  which  is  not  true.  Each 
ridge  fragment  generates  a  countable  number  of  nonzero  ridgelet  coefficients 
in  general.  The  paper  [6]  gets  (4)  using  much  more  subtle  estimates. 

§7.  Discussion 

Why  call  these  things  curvelets? 

The  visual  appearance  of  curvelets  does  not  match  the  name  we  have  given 
them.  The  curvelets  waveforms  look  like  brushstrokes;  brush  lets  would  have 
been  an  appropriate  name,  but  it  was  taken  already,  by  F.  Meyer  and  R. 
Coifman,  for  an  unrelated  scheme  (essentially,  Gabor  Analysis). 

Our  point  of  view  is  simply  that  curvelets  exemplify  a  certain  curve  scaling 
law,  width  =  length‘s,  which  is  naturally  associated  to  curves. 

A  deeper  connection  between  curves  and  curvelets  was  alluded  to  in  our 
talk  at  Curves  and  Surfaces  ’99.  Think  of  a  curve  in  the  plane  as  a  distribu¬ 
tion  supported  on  the  curve,  in  the  same  way  that  a  point  in  the  plane  can 
be  thought  of  as  a  Dirac  distribution  supported  on  that  point.  The  curvelets 
scheme  can  be  used  to  represent  that  distribution  as  a  superposition  of  func¬ 
tions  of  various  lengths  and  widths  obeying  the  scaling  law  width  =  length‘s. 
In  a  certain  sense  this  is  a  near-optimal  representation  of  the  distribution. 

The  analogy  and  the  surprise 

Sections  2  and  3  showed  that  wavelets  do  surprisingly  well  in  representing 
point  singularities.  Without  attempting  an  explicit  representation  of  ‘where 
the  bad  points  are’,  wavelets  do  essentially  as  well  as  ideal  adaptive  schemes 
in  representing  the  point  singularities. 

Sections  4-6  showed  that  the  non-adaptive  curvelets  representation  can 
do  nearly  as  well  in  representing  objects  with  discontinuities  along  curves  as 
adaptive  methods  that  explicitly  track  the  shape  of  the  discontinuity  and  use 
a  special  adaptive  representation  dependent  on  that  tracking. 

We  find  it  surprising  and  stimulating  that  the  curvelet  representation 
can  work  so  well  despite  the  fact  that  it  never  constructs  or  depends  on  the 
existence  of  any  ‘map’  of  the  discontinuity  set. 

We  also  find  it  interesting  that  there  is  a  system  of  analysis  which  plays 
the  role  for  curvilinear  singularities  that  wavelets  play  for  point  singularities. 
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Bases  in  Function  Spaces  on  Compact  Sets 

Zbigniew  Ciesielski 


Abstract.  This  is  a  brief  history,  covering  the  twentieth  century,  of 
spline  bases  on  cubes,  and  an  exposition  of  constructing  bases  in  classical 
function  spaces  over  compact  smooth  finite  dimensional  manifolds. 


§1.  Introduction 

The  aim  of  this  paper  is  to  present  an  overview  on  some  twentieth  century 
developments  in  the  theory  of  spline  bases.  We  start  by  recalling  some  of  the 
relevant  notions  on  bases  in  Banach  spaces  (for  more  details  see  e.g.  [1,30]). 
For  simplicity  we  are  going  to  stay  within  the  real  Banach  spaces.  An  abstract 
Banach  space  X  with  the  norm  ||  •  ||x  is  denoted  as  [X,  ||  •  ||x]-  The  sequence 
{xn,n  =  0, 1, . . .)  in  [X,  ||  •  ||x]  is  called  a  basis  in  X  if  to  each  x  e  X  there  is 
a  unique  sequence  of  scalars  a  =  (on,  n  =  0, 1, . . .)  such  that 

CXD 

X  ~  ^  "]  CLfl  Xyi,  (1) 

n=0 

There  are  unique  linear  functionals  (x*)  C  X*  such  that  On  = 
system  (xq,  a^i,  •  •  • ;  •  •  •)  is  biorthogonal  i.e.  =  6k, i-  The  basis 

(x„)  is  unconditional  if  for  each  rc  G  X  the  series  in  the  right  hand  side  of  (1) 
converges  unconditionally.  Now,  denote  by  A  the  set  of  all  a  appearing  in  (1) 
while  X  is  running  through  X.  The  linear  space  A  becomes  a  Banach  space 
linearly  isomorphic  to  X  with  the  norm 

n 

||a|U  =  supll  (2) 

The  Banach  space  [A,  ||  •  |U]  is  customarily  called  the  coefficient  space.  Intro¬ 
ducing  the  basis  constant,  which  by  the  Banach-Steinhaus  theorem  is  finite, 

n 

(3=  sup  supIlVuiXillx,  (3) 

INx<in>o  ^ 
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we  obtain  the  equivalence  of  norms 

IklU  <  Ma  <  /3\\x\\x^  (4) 

Thus,  every  Banach  space  with  a  basis  is  linearly  isomorphic  to  a  sequence 
space.  However,  depending  on  the  space  X  and  on  the  particular  basis,  the 
corresponding  sequence  space  may  be  of  little  use.  Sometimes  it  helps  to  con¬ 
sider  equivalent  basis  in  X.  Two  basis  (a:„)  C  X  and  (x'^)  C  X'  are  said 
to  be  equivalent  if  .4  =  In  case  of  equivalent  bases  we  conclude  that 
II^IU  ~  II^IU'  for  a  £  A.  Now,  we  may  describe  the  program  of  the  pa¬ 
per.  For  a  given  compact  C°°  finite  dimensional  manifold  M  and  for  given 
order  of  smoothness  m,  we  are  going  to  describe  the  construction  of  biorthog- 
onal  system  of  functions  of  class  over  M  such  that  the  system  itself  is  a 
basis  in  V’MO(M),  Hi[M)  and  and  in  the  whole  scale  of  Sobolev  (M), 
-m  <  k  <m,  and  of  Besov  spaces  ^(M),  -m  <  s  <  m,  with  1  <  p,q  <  oo. 
At  the  same  time  the  dual  system  is  going  to  be  a  basis  in  the  same  scale  of 
function  spaces  with  the  corresponding  spaces  VMO,Hi,W  and  B  replaced 

O  O  0  o 

by  VMO,  Hi,  W  and  B,  respectively.  The  constructed  system  of  functions  (or 
its  dual)  is  always  an  unconditional  basis  whenever  the  space  admits  an  un¬ 
conditional  basis.  Moreover,  for  the  constructed  basis,  we  are  able  to  describe 
the  coefficient  spaces  in  case  of  the  BMO  and  Besov  spaces.  The  duality 
questions  will  be  treated  at  the  same  time.  The  main  idea  of  the  general 
construction  was  announced  by  T.  Figiel  and  the  author  at  the  Gdansk  1979 
conference;  Approximation  and  function  spaces  (cf.  [14]),  and  then  carried 
out  in  the  subsequent  papers  [10,11,15,16,17]. 

The  material  is  arranged  as  follows:  Sections  2  presents  historical  remarks 
on  the  Haar,  Faber-Schauder,  Franklin  and  spline  systems;  Section  3  treats 
function  spaces  and  bases  with  boundary  conditions  on  the  cube;  Section  4 
describes  the  reduction  of  function  spaces  and  bases  from  manifolds  to  the 
cubes  with  boundary  conditions. 

It  is  encouraging,  that  in  recent  years,  the  ideas  of  the  constructions  from 
[16,17]  stimulated  works  on  modifications  of  the  decomposition  of  the  func¬ 
tion  spaces  on  smooth  compact  manifolds  into  standard  spaces,  and  also  on 
constructing  new  bases  in  the  standard  spaces.  The  new  investigations  of 
W.  Dahmen  and  R.  Schneider  as  they  were  presented  at  this  Saint  Malo  con¬ 
ference  (see  also  [19])  are  very  promising  as  they  show  that  these  constructions 
can  be  applied  to  treat  singular  operators  on  manifolds  both  theoretically  and 
numerically. 

§2.  The  History  of  Haar,  Faber-Schauder,  Franklin  and  Spline 

Systems 

At  the  very  origin  there  is  the  construction  of  A.  Haar  (1909)  [25]  of  a  simple 
ONC  (orthonormal  and  complete)  system  x  =  {Xn,^^  =  1,...)  on  /  =  [0,1]. 
The  system  x  has  the  nice  property  that  each  continuous  function  has  its 
Fourier-Haar  series  uniformly  convergent  on  I.  Here  and  later  on,  unless 
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otherwise  stated,  the  orthogonality  is  understood  with  respect  to  the  Lebesgue 
measure.  The  orthonormal  Haar  functions  over  I  can  be  defined  by  means  of 
a  single  function  h,  where 


1  for  t  <  0, 

—1  for  0  <  f  < 

^  0  otherwise. 

Define  for  j  >  0,  1  <  <  2^  and  n  =  2^  +  k 

1 

X1  =  1,  and  =  =  (^) 

The  Haar  functions  are  piecewise  constant  and  left  continuous  i.e.  they  are 
splines  of  order  r  =  1  (of  degree  r  —  1  =  0).  For  later  convenience  for  the 
support  of  Xn  we  introduce  the  symbol  (n)  =  (and  let 

denote  the  middle  point).  It  was  pointed  out  by  J.  Schauder  (1927)  [29] 
that  the  Haar  system  x  is  a  basis  in  the  Lebesgue  space  Lp{I),  1  <  p  <  oo, 
with  the  basis  constant  equal  to  1.  Much  more  involved  was  the  proof  of 
R.E.A.C.  Paley  (1932)  [27]  (see  also  J.  Marcinkiewicz  (1937)  [26])  that  the 
Haar  system  is  an  unconditional  basis  in  each  Lp(I),  1  <  p  <  oo.  For  a  real 
variable  proof  of  this  property,  we  refer  e.g.  to  Ch.  Watari  (1964)  [31].  The 
unconditional  basic  constant  for  the  Haar  system  in  Lp{I)  appears  to  be  equal 
to  max(p,p')  ~  1  where  l/p+  1/p'  ~  1  (see  e.e.  D.L.  Burkholder  [5]).  The 
extensively  investigated  martingale  theory  covers  many  results  on  the  Haar 
system,  but  it  is  not  very  related  to  our  subject,  and  will  not  be  discussed 
here  (see  e.g.  [23]). 

To  construct  the  Faber-Schauder,  Franklin  and  more  general  spline  sys¬ 
tems,  it  is  convenient  to  introduce  the  following  operations  on  sequences  of 
functions.  For  a  given  sequence  -ip  =  =  Ij  2, . . .)  of  integrable  functions 

on  J,  we  define 

G3^=  (l,G^n,«  =  1,2, ...)  and  =  (GV’n,?!  =  1, 2, . . .), 

where  G/(t)  =  Jq  f(s)  ds.  If  in  addition,  the  functions  in  ^  are  linearly 
independent,  then  the  result  of  the  Gram  orthogonalization  process  applied 
to  Tp  is  denoted  by  O'lp.  It  is  assumed  in  this  definition  that  so  obtained 
orthogonal  set  is  normalized  in  L^(/). 

The  Faber-Schauder  system  can  now  be  obtained  from  the  Haar  system 
by  the  operation  G 

0  =  (0n,n>O)  =  Gx.  (6) 

The  Faber-Schauder  functions  are  continuous  splines  of  order  2.  It  was  proved 
by  G.  Faber  (1910)  [21]  (see  also  J.  Schauder  (1927)  [29])  that  this  system  is 
a  basis  in  the  space  of  continuous  functions  [G(/),  ||  •  [[oo].  In  this  case  the 


h{t)  =  { 
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basis  constant  is  again  equal  to  1  and  the  basis  itself  is  interpolating  at  dyadic 
points  of  /. 

The  orthonormal  set  constructed  by  Ph.  Franklin  (1928)  [22]  can  now  be 
defined  as  the  result  of  application  of  the  operation  O  to  the  Schauder  system 

/  =  (/n,n  =  0,l,...)  =  Of  (7) 

These  functions  are  again  continuous  splines  of  order  r  =  2.  Ph.  Franklin 
proved  in  [22]  that  /  is  a  basis  in  [C(I),  ||  •  ||oo]-  For  an  elegant  proof  that  the 
Franklin  system  is  a  basis  in  C(I)  and  in  Lp{I),  1  <  p  <  oo,  we  refer  to  [6], 
Using  the  same  idea  as  in  [31],  S.  V.  Botchkarev  (1974,  1975)  [3,  4]  proved 
the  unconditionality  of  the  Franklin  system  in  each  Lp{I),  1  <  p  <  oo.  There 
is  an  extensive  literature  on  the  pointwise  behavior  of  the  Franklin  series,  but 
we  mention  only  the  expository  article  by  G.  G.  Gevorkyan  [24]. 

The  operation  G  increases  the  order  r  of  splines  and  the  order  of  their 
smoothness  by  1,  and  O  preserves  these  orders.  We  may  repeat  this  two  step 
process  starting  now  with  the  orthonormal  Franklin  system  and  then  repeat 
it  again  and  again.  In  general,  for  r  >  1  we  use  the  notation 

/('■)  =  (/<’■>,«>  1  -  r)  and  >  -r)  =  G/'’') 


and 

fir+l)  ^ 

Consequently,  we  have  for  the  order  r  >  1  the  following  inductive  formula  for 
the  spline  ONC  system  on  I 

=  O  o  with  =  X-  (8) 

In  particular  in  this  notation  ^  and  /  =  was  proved  in  [7] 

that  is  a  basis  in  [(7(7),  ||  •  jjool  and  by  J.  Radecki  (1970)  [28]  that 
is  a  basis  in  [(7(7),  ||  *  jjoo]  and  in  each  7/p (7),  1  <  p  <  oo.  The  proof  that 
for  arbitrary  r  >  1  is  a  basis  in  (7(7)  and  in  Lp(I)  follows  from  the  work  of  J. 
Domsta  (1972)  [20]  (see  also  [12]). 

From  the  construction  of  the  ONC  system  it  follows  that  its  first  r 
elements  . . . ,  are  simply  the  orthonormal  Legendre  polynomials  on 
7;  the  degree  of  is  i  +  r  -  2.  Now,  with  each  r  >  1  we  associate  a  family 
of  spline  systems 

=  (/^->fc),n>|A:H-l-r)  with  -  r  <  k  <  r,  (9) 

where 

for  0  <  <  r. 


J  n 


for  — r  <  A:  <  0 


(10) 
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and  Df(t)  =  ^f{t)  and  H/(i)  =  //  f{s)  ds.  Since  D  is  inverse  to  G  and  H  is 
adjoint  to  G  in  L\I)^  it  follows  that  for  lfc|  <  r, 

=  for  »,m>|fc|  +  l-r.  (11) 

Equally  important  are  spline  ON  systems  defined  by  formula  similar  to  (8) 
with  G  replaced  by  Go,  i.e. 

^(r+l)  _  Q  Q  Q^^(r)  ^  ^12) 

Here  again,  with  each  r  >  1  we  associate  a  family  of  spline  systems 

n  >  I)  with  —r<k<r,  (13) 

where 

forO<fc<r-. 

\  for  -r  <k  <  0; 

and  as  before  we  have  for  |fc|  <  r, 

=  for  n,m>l.  (15) 


In  what  follows  in  this  section  we  denote  by  either  or  .  Since 
the  family  {F^’’’^^ ,  -r  <  A;  <  r}  of  spline  systems  is  the  main  ingredient  in  the 
construction  presented  in  the  next  section,  it  is  natural  to  recall  now  its  basic 
properties.  Notice  that  the  elements  of  are  indexed  by  n  >  n{k,F), 

where  n{k,  /)  =  |fc|  +  2  -  r  and  n{k,  g)  =  1. 

For  given  r  and  k  such  that  \k\  <  r  and  for  given  n  >  n(fc,F),  we  have 
the  kernel  corresponding  to  the  partial  sum  operator  with  index  n 

for  s,tel.  (16) 

v=nik,F) 

The  following  exponential  estimates  (cf.  [8,13,17])  play  a  fundamental  role 
in  our  construction.  There  are  two  constants:  C  =  Cr  <  oo  and  q  =  Qr, 
0  <  q  <1,  such  that  for  |A:|  <  r  we  have 

llf<’'’*>(s,t)|  <C'-(n  +  r).g(’’+’'>l“-'l  for  s,tel,  (17) 

and 

where  has  been  defined  earlier  as  the  middle  point  of  (n).  Now,  the 
biorthogonality  (11),  (15)  and  (18)  imply  for  1  <  p  <  oo,  \k\  <  r,  and  for 
any  real  sequence  (a„,  2^  <  n  <  2'^'*‘^),  {j  >  0),  the  equivalence 

11  Y,  Y  (19) 

2J<n<2J+i  27<n<2J+i 

Moreover,  it  follows  that 

II  Y  llp~ll  E  Up.  (20) 

2i<n<2i+i  2J<n<2J+i 

where  the  positive  constants  in  the  equivalences  ~  in  (19)  and  (20)  depend 
on  r  only. 

Now,  as  one  of  the  consequences  of  (17)  and  (18),  we  obtain 
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Theorem  1.  For  given  r,  k,  |A:|  <  r  and  p,  1  <  p  <  oo,  the  system 
is  a  basis  in  Lp(I).  Moreover,  for  1  <  p  <  oo,  each  of  the  systems  is  an 
unconditional  basis  in  Lp(I),  and  all  of  them  are  equivalent  bases  in  this 
space.  Moreover,  is  a  basis  in  C(I)  for  each  k,  0  <  k  <  r  —  2. 


§3.  The  Standard  Spaces  over  Cubes 

We  start  with  general  setup  which  will  be  needed  in  the  following  sections.  Let 
the  dimension  d  be  fixed,  and  let  M  be  a  compact  C°°  d-dimensional  manifold 
(d-manifold).  For  simplicity,  we  assume  here  that  M  has  no  boundary.  We 
denote  by  p  one  of  the  measures  which  locally  is  of  the  form  dp  —  hdx  where 
h  is  positive  C°°  function.  A  closed  set  Q  C  M  is  said  to  be  a  d-cube  if 
it  is  diffeomorphic  to  the  standard  cube  [0, 1]*^.  A  compact  set  AT  C  M  or 
K  C  is  said  to  be  proper  if  it  can  be  viewed  locally  as  an  epigraph  of  a 
lipschitzian  function  of  d  —  1  variables  (cf.  Def.  3.1  in  [16]).  We  are  going 
to  discuss  function  spaces  F{K)  over  a  proper  subsets  K,  in  particular  the 
Sobolev  spaces  with  F  =  and  the  Besov  spaces  with  T  =  Bp  q.  In  the 
Sobolev  space  W^[K),  K  C  R^,  1  <  P  <  oo,  m  >  0,  we  shall  use  the  norm 

ii/ii^>(^^)=  E  ii^“/iip(^)-  (21) 

|Q:|<m 


Clearly,  W^(K)  =  Lp{K)  and  we  denote  by  W^{K)  the  space  C^{K).  More¬ 
over,  the  space  of  equivalence  classes  of  measurable  functions  over  K  equipped 
with  the  topology  of  convergence  in  measure  is  denoted  by  Lo(A').  In  order 

to  define  Wp{K)  for  fc  <  0,  we  introduce  W^{K)  for  each  m  >  0  as  the 
closure  in  the  norm  (21)  of  smooth  functions  /  such  that  supp/  C  intF.  For 
l^P^oo,  fc<0  and  for  g  G  Wp,{K)  put 

||g||W(A:)  =  sup{|  J^fgdx\  :  <  1,  /  €  (22) 

where  p'  =  p/(p  —  1)  for  1  <  p  <  oo  and  1'  =  oo,  oo'  =  1.  Now,  the  completion 
of  Wp,{K)  in  the  norm  (22)  defines  the  space  Wp,{K). 

Let  now  I  =  [0,1],  Q  =  and  let  Z  be  a  boundary  set  i.e.  a  set  which  is  a 
union  of  (d  —  l)-dimensional  faces  of  Q.  To  each  pair  {F{Q),  Z]  we  associate 
a  subspace  of  F'{Q)  of  functions  which  are  vanishing  on  Z  C  dQ  in  the  sense 
described  below.  To  each  Z  there  are  unique  Z,  C  dl,  i  =  1, . . .  ,d,  such  that 

Z  =  Q\(/\Zi)x...x(/\Z,).  (23) 


Now,  define  for  each  Z  the  parallelepiped 


Qz  =  Izi  X  ...  X  Iza 


(24) 
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where 


r[o,i] 

for  Z  =  0, 

[-1,1] 

for  Z  =  {0}, 

1  [0.2] 

for  Z  =  {1}, 

U-1.21 

for  Z  =  {0, 1}. 

(25) 


If  /  G  Lo{Q),  we  denote  by  fz  the  element  of  Lo{Qz)  such  that  fzlq  —  f 
and  fz  on  Qz  \  Q- 

Definition  2.  For  given  integer  k  and  I  <  p  <  oo,  put 


\\f\\^J‘\Q)z  =  ll/zllf  (Oz). 


Now,  if  k  >  0,  define 

w^{Q)z  =  {fe  W^iQ)  :  fz  e  W^iQz)h 
and  ifk<0  then  introducing  Wq  =  {/  €  Wp(Q)  :  fz  €  Wp{Qz)},  deSne 
W'p  (Q)z  =  completion  of  Wq  in  the  norm  \\f\\^\Q)z- 

The  spaces  [Wp{Q)z,  ||  ■  \\^\Q)z]  are  called  standard. 

Notice  that  for  A:  >  0  the  set  {fz  :  f  e  Wp{Q)z}  is  a  closed  subspace 
of  Wp{Qz),  and  by  Definition  2  the  map  f  ^  fz  an  isometry.  Now  let 
k  <  0.  In  this  case  the  map  f  ^  fz  extends  to  an  isometry  of  Wp{Q)z 
into  Wp{Qz)’  Thus  Wp{Q)z  is  always  complete,  and  the  image  of  the  map 
/  ►->  /z  is  a  closed  subspace  of  Wp{Qz)-  We  have  constructed  in  [16],  using 
the  formulae  (23)  and  (24)  and  the  generalized  Hestenes  extension  operators, 
a  bounded  projection  onto  this  subspace. 

Proposition  3.  Let  m  >  1  and  the  boundary  set  Z  C  dQ  be  given.  Then 
there  are  a  continuous  linear  operator  P  in  Lo{Qz)  and  C  <  oo  such  that  P 
projects  Lo(Qz)  onto  {/  :  /  =  0  a.e.  on  Qz  \  Q}  and  for  1  <  p  <  oo  we  have 

IIPflffHQz)  <  C  ll/ll^CSz)  for  f  €  W^iQz),  |fc|  <  ra.  (26) 

Thus,  P  projects  Wp{Qz)  onto  a  subspace  which  is  via  the  map  f  fz 
linearly  isomorphic  to  Wp(Q)z^ 

Now,  for  <  0  and  1  <  p  <  oo  we  define  the  bilinear  form 

g*{f)=  f  fgdx  for  g*  e(W^HQ)z’r,  g&w°{Q).  (27) 

JQ 

Proposition  4.  Let  fc  <  0  and  1  <  p  <  oo  be  given.  Then  the  map 
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defined  in  (27)  extends  to  a  linear  isomorphism  ofWp(Q}z  onto  a  subspace 

Now  suppose  real  s  and  1  <  p,  g  <  oo  are  given.  Moreover,  let  be  a 
proper  set.  Then  for  any  integers  fc,  /  such  that  /  <  s  <  fc,  we  have  the  real 
interpolation  formula  for  the  Besov  space  (with  9  =  {s  -  l)/{k  -  /)) 

=  (28) 

For  /  e  Bp^^{K)  the  norm  is  denoted  by  ||/||p!g(-^^).  The  Besov  space  over  Q 
with  I  =  0  <  s  <  k  and  corresponding  to  the  boundary  set  Z  C  dQ  is  now 
defined  by  the  formula 

B;jQh  =  {/  e  y^piQ) :  fz  e  b;jqz)}.  (29) 

Moreover,  let  us  define  for  f  €  Bp  g{Q)z 

ll/ilj,!^(0)z  =  WhtliQz)'  (30) 


The  Besov  space  [B‘  ij{Q)z,  IHIri!?(<?)z]  will  be  called  standard  as  well.  Notice, 
that  Bp  ,j{Q)z  C  jBp^g(Q),  but  it  may  be  not  a  closed  subset  of  Bp  g{Q). 

Proposition  5.  Let  the  parameters  I,  k,  6,  s,  p,  q  be  given  as  for  (28),  and  let 
J^{Q)z  =  (:fo{Q)z,:Fi{Q)z)s,,  where  Te  =  IV',  T,  =  (31) 


Then,  T{Q)z  =  Bp^^[Q)z  fors>0  and  for  s  <  0  the  space  f{Q)z  is  naturally 
identified  with  the  closure  ofWp  in  (Bp/^,(Q)z')*- 

The  proof  is  based  on  the  existence  of  the  projection  P  in  Proposition  3, 
and  on  the  general  properties  of  the  real  interpolation  spaces  (see  [2,  16]). 

Corollary  6.  Suppose  we  are  given  real  numbers  s,  1  <  p,q  <  oo,  an  integer 
k,  and  a  boundary  set  Z  C  dQ  of  the  cube  Q.  Then  the  standard  spaces 
T{Q)z  are  well  defined  for  T  —  Wp  or  Bp  ^^.  Moreover,  if  I  <  s  <  k,  then 
formula  (31 )  takes  place. 

In  the  last  part  of  Section  3  we  are  going  to  present  a  construction  of 
spline  bases  in  the  P(Q)z  spaces.  Actually,  according  to  (31),  it  is  sufficient 
to  do  it  for  the  Sobolev  spaces  Wp(Q}z’ 

We  start  with  the  case  of  dimension  d  =  1.  To  each  Z  C  dl  and  for  an 
integer  m  =  r-  2>0a  spline  system  is  defined  as  follows: 


FM(1;Z)  = 


(t)  if  Z  =  0  and  n  >  2  -  r, 

^(2r,-r)(j)  jf  2  =  (0, 1}  and  n  >  2  —  r, 
if  Z  =  {0}  and  n  >  1, 
^Pr,-r)(j)  if  2  =  {1}  and  n  >  1; 


(32) 
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where  the  and  are  given  as  in  (10)  and  (14),  respectively.  Moreover, 
let 


n(Z)  =  n{Z^  0)  and  n(Z,  k) 


\k\-\-2-r  if  Z  =  0, 

|A:|  +  2-r  ifZ  =  {0,l}, 
1  if  Z  =  {0}, 

1  if  Z  =  {!}. 


(33) 


For  simplicity  let  us  write  F_^^\Z)  =  ;Z),n  >  n{Z)).  Notice  that 

n(Z,  k)  =  n(Z',  -k),  and  that  the  two  systems  F^‘^\Z)  and  F^"^^(Z'),  where 
Z'  =  dl\  Z,  are  dual,  i.e.  they  are  biorthogonal  in  the  L‘^{I)  scalar  product 


(F^\- ;  Z), ;  Z'))  =  Sij  for  i,j  >  n{Z).  (34) 

Now,  we  introduce  related  family  of  biorthogonal  systems  indexed  by  k  with 
\k\  <  m.  Namely,  for  j  >  n(Z, /j),  let 


jp{m,k) 

^3 


(•;Z)  = 


D'=Fj’"'(- ;Z)  for0<fc<m, 
H-*=Fj”’)(-  ;  Z)  for  -m  <  A;  <  0; 


and  the  biorthogonality  for  |fc|  <  m  is  as  follows 


(35) 


(f;'”’*)(-;Z),F;™'-*'(-;Z'))  =  «M  for  i,j>n{z,k). 


p(m,-k) 


(36) 


Theorem  7.  For  each  Z  C  dl,  1  <  p  <  oo,  the  system  F^”^^(Z)  is  in  W^(I)z 
and  it  is  a  basis  (an  unconditional  basis  if  1  <  p  <  oo)  in  each  Wp(I)z 
for  k  =  This  means  that  it  is  a  simultaneous  basis  (simultaneous 

unconditional  basis  ifl<p<oo)in  ||  • 

Proof;  To  see  how  the  proof  works,  let 


Pnf{x;Z)=  Y,  (f,Fj'"\-;Z'))Fl"‘\x;Z)  for  /  €  ip(/).  (37) 

n(Z)<j<n 

Then  we  find  that  for  0  <  /c  <  m, 

D‘F„/(a:;Z)  =  pW(I>V)(a:;Z)  for  f  (38) 

where  for  g  G  W®  (/) 

pW(s)(x;Z)=  (5, ;Z'))ir(”''=>(a;;Z).  (39) 

j-n{Z,k) 


Now,  Theorem  7  follows  immediately  from  Theorem  1  by  (38)  and  (39).  □ 
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Now  we  consider  the  case  of  dimension  d>  1,  with  Q  =  Suppose  we 
are  given  Z  C  dQ.  Then  according  to  (23)  the  Zi  C  dl,  for  f  =  1, . . . ,  d,  are 
determined.  We  are  ready  now  to  construct  the  tensor  product  basis  corre¬ 
sponding  to  the  boundary  set  Z.  Each  function  of  the  basis  under  construction 
is  determined  by  an  integer  vector  j  =  satisfying  the  inequality 

j  >  n{Z)  with  n{Z)  =  {n{Zi), ,  n{Zd))  i.e.  ji  >  n{Zi)  for  z  =  1, . . . ,  d. 

Given  the  order  of  smoothness  m  >  0,  we  now  define  the  j’s  function  as 
follows: 

Z)  =  Zi)  x  •  ■  •  x  Z^),  (40) 

where  a:  =  (a:i, . . . ,  Xd)-  The  indices  j  are  ordered  in  the  rectangular  way  (cf. 

[12],  p.  221). 

Theorem  8.  The  system  ;  Z),j  >  n{Z))  in  the  rectangular  ordering 

is  a  basis  in  W^{Q)z  for  1  <  p  <  oo,  and  in  addition  it  is  unconditional  in 
these  spaces  if  1  <  p  <  oo. 

Our  next  goal  is  to  modify  the  basis  (40)  in  such  a  way  that  the  ele¬ 
ments  of  the  new  basis  will  be  concentrated  around  the  corresponding  dyadic 
points  in  Q.  To  this  end  let  us  introduce  in  dimension  one  the  following  finite 
dimensional  spline  spaces: 

5“(Z)  =  span{FH(-;Z):  n(Z)<i<2'')  where  p  >  1.  (41) 

Now,  without  going  into  details,  we  accept  the  Definition  10.17  of  [17]  of  the 
new  spline  basis  in  S^{Z),  i.e.  of  ;  Z)  with  n{Z)  <  j  <  2^.  The  new 

basic  functions  for  the  standard  space  W^{Q)z  are  now  defined  as  follows. 
For  convenience,  let  D  =  {1, . . . ,  d},  and  let 

No{Z)  =  {j  :  n{Zi)  <ji<l  for  i  €  D}, 

and  for  every  e  C  D,  e  ^  0,  /x  >  1,  let 

NeAZ)  =  {j  :  2/^"'  <  3i  <  2^  for  i  G  e,  n{Zi)  <  ji  <  2'^-^  for  z  G  D  \  e}. 

We  also  introduce 

N^{Z)=  (J  N,JZ). 

fb^eCD 

Definition  9.  Let 

G'f'\--,Z)  =  Fj"'\--,Z)  for  jeNoiZ), 

and 

G<f'\-;Z)  =  0Fjy{--,Zi)0  0  FjF^(--,Zi)  for  j  e  N,,^{Z) 

iee  ieD\e 


for  any  e  C  D,  e  ^  0,  and  p  >  1. 
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Any  ordering  of  the  set  of  indices  {j  :  j  >  n{Z)}  is  said  to  be  regular  if 
j  -<  j'  for  any  j  G  and  ^  G  whenever  fj.  <  p'  (cf.  [10]).  We  also  have 
the  biorthogonality  relation 

(G^p\--,Z),GP(--,Z'))  =  6yj.  (42) 

We  can  now  state  the  result  on  ’universal  basis’  in  standard  spaces  (cf.  [10, 

16]) 

Theorem  10.  Let  m  >  0  be  a  given  order  of  smoothness.  The  system 
;Z),j  >  n(Z))  in  the  regular  ordering  is  a  basis  in  all  the  spaces 
W'p(Q)z  for  0  <  k  <  m,  1  <  p  <  oo,  and  in  addition  it  is  unconditional  if 
1  <  p  <  oo.  Moreover,  for  p  >  1,  1  <  p  <  oo,  we  have 

II  aj  Gf  >(•  ; Z%  ^  ^  |a,.|*>)'/’’,  (43) 

where  the  constants  in  the  relation  ~  depend  only  on  d  and  m. 

Corollary  11.  The  system  ;  Z),j  >  n{Z))  in  the  regular  ordering  is 

a  basis  in  all  the  spaces  Bp^q(Q)z  with  1  <  p,q  <  oo,  0  <  s  <  m.  Moreover, 
for 

/(•)=  E 

3>n{Z) 

letting  a  =  5/0?  +  1/2  -  1/p,  we  have 

\\f\\i:l{Q)z  -  f  E  E 

The  constants  in  the  relation  depend  on  m,  s  and  on  d. 

For  an  arbitrary  d-cube  Q,  the  function  space  F{Q)z  is  defined  as  the 
image  of  F{I^)z  under  the  linar  mapping  induced  by  the  diffeomorphism 
between  Q  and 

§4.  Decomposition  of  Function  Spaces  over  Smooth  Manifolds 

Let  us  start  with  the  decomposition  of  M  without  boundary  (for  M  with 
boundary  cf.  [16]).  We  say  that  M  admits  decomposition  into  d-cubes  if  for 
some  N  there  are  d-cubes  Qi,...  ,Qn  C  M  such  that  l)j<NQj  =  M  and  if  $j 
is  a  diffeomorphism  of  [0,  l]'^  onto  Qj,  1  <  j  <  N,  then  the  set  $“^(Ui<jQi) 
is  the  union  of  some  (d  -•  l)-dimensional  faces  of  [0,  l]'^.  The  decomposition 
Qi, . . .  Qn,  is  said  to  be  proper  if  the  sets  Ui<jQi  are  proper  for  j  =  0, . . . ,  N. 
Now  we  have  the  following  result  whose  proof  depends  very  much  on  Morse 
theory  (cf.  Theorem  3.3  in  [16]): 

Theorem  12.  Let  M  be  a  compact  d-manifold.  Then  M  admits  a  proper 
decomposition. 
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For  the  Sobolev  Wp{M)  and  Besov  spaces,  we  have  the  real 

interpolation  formula  for  any  integers  I,  r,  real  s,  /  <  s  <  r,  and  for  1  <  p,  7  < 
00  (cf.  [15]) 

(M)),  ,,  (44) 

where  s  =  (1  -  0)/  4-  9r.  We  recall  that  for  the  standard  Besov  spaces,  we 
have  similar  formula  (cf.  Proposition  5).  Having  a  proper  decomposition  of 
M  (Theorem  12),  we  would  like  to  obtain  a  corresponding  decomposition  of 
!F{M)  into  a  direct  sum  of  standard  spaces  T{Qj)zj‘  Let  Lq{M)  denote  the 
space  of  all  measurable  functions  (of  equivalence  classes)  with  the  topology  of 
convergence  in  measure. 

Proposition  13.  Let  Qij-'jQn  be  a  proper  decomposition  of  M  into  d- 
cubes  as  in  Theorem  12.  Let  p  be  a  smooth  measure  on  M.  Then,  for  any 
m  >  1,  one  can  construct  continuous  linear  operators  Pi, ... ,  Pn  ^be  space 
Lq(M)  with  the  following  properties  for  f  G  Lo(M): 

i<N 

PiPjf  =  0,  in<ij^j<N,  (46) 

XQ,Pjf  =  PiXQ,f,  in<i<j<N,  (47) 

there  is  C  <  oo  such  that  for  all  spacer  WpM),  0  <  k  <  m,l  <  p  <  co  ,  for 
all  g  e  WpM)  and  for  1  <i  <  N  we  have 

WPig^HM)  <  C\\g\\f\M),  (48) 

the  adjoint  operators  (in  the  Hilbert  space  L2(M,  p))  P^ , . . . ,  P^  satisfy  the 
analog  of  (48). 

Proposition  13  implies  the  main  result  on  decomposing  the  function  spa¬ 
ces  P{M)  (see  [16]),  i.e. 

Theorem  14.  Let  Qi,- ..  ,Qn  be  a  proper  decomposition  of  M  as  in  Theo¬ 
rem  12,  and  let  Pi,...,  Pn  be  the  linear  operators  from  Proposition  13.  Then 
the  formulae 

Tof  =  Y.  W<Pif^  Vof  =  Y.  XQ<P:f 

i<N  i<N 

dehne  linear  isomorphism  of  Lo(M)  onto  itself,  the  inverse  maps  being,  re¬ 
spectively, 

Sof  =  Y  Uof  =  Y 

i<N  i<N 

Moreover,  if  T  denotes  Wp,  0  <  k  <  m,  1  <  p  <  oo,  then  Tq,  Vq  induce  linear 
topological  isomorphism 

T  :  J'(M)  ^  0  PiQi)z, .  V  :  -  0  ^^(QOz; . 

i<N  i<N 

where  Zf  =  Qi  Ui<j  Qj  and  Z\  =  Qi  Uj>i  Qy 
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Corollary  15.  The  assertions  of  Theorem  14  remain  true  for  T  =  Wp,  Bp  ^ 

with  |fc[  <  m,  jsj  <  m,  1  <  p,q  <  oo.  Moreover,  there  is  now  an  obvious 

extension  of  Theorem  10  and  Corollary  11  to  fF{M)  for  all  these  spaces. 
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A  Note  on  Convolving 
Refinable  Function  Vectors 


C.  Conti  and  K.  Jet  ter 


Abstract.  When  convolving  two  refinable  function  vectors  which  give 
rise  to  convergent  subdivision  schemes,  the  convolved  scheme  is  again  con¬ 
vergent.  Moreover,  the  conditions  on  the  mask  symbols  which  characterize 
the  approximation  order  of  the  associated  shift  invariant  spaces  show  that 
the  order  of  the  convolved  space  is,  essentially,  the  sum  of  the  order  of  the 
two  spaces  originating  from  the  convolution  factors. 


§1.  Kronecker  Convolved  Function  Vectors 

Our  previous  paper  [3]  deals  with  special  subdivision  schemes  associated  with 
a  shift  invariant  space  of  bivariate  spline  functions,  where  the  “generators”  of 
the  shift  invariant  space  are  produced  through  convolving  lower  order  splines 
of  small  support.  The  present  paper  gives  a  more  detailed  and  more  systematic 
analysis  of  this  convolution  process.  In  this  way  it  is  possible  to  prove  that 
(i)  the  convolution  of  convergent  subdivision  schemes  yields  a  scheme  which  is 
convergent  as  well,  and  (ii)  essentially,  the  approximation  power  of  a  convolved 
shift  invariant  space  is  at  least  the  sum  of  the  approximation  powers  of  the 
two  convolution  factors. 

We  recall  that  a  vector  $  =  ,  <^n)^  of  (continuous,  compactly 

supported)  d-variate  functions  is  called  refinable,  if  it  satisfies  a  refinement 
equation 

#  =  x;  ■^(2  •  -«)  •  (1-1) 

Here,  the  refinement  matrix  mask  A  =  is  a  matrix  sequence  with 

each  ‘coefficient’  Aa  being  a  real  (n  x  n)-matrix.  We  allow  only  masks  of  finite 
support,  i.e.,  Aa  =  0  except  for  finitely  many  a  G  7L^. 
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Given  another  refinable  function  vector  ^  =  (V'l, -02,  •  •  •  ?  ^  m  f  of  d- 
variate  functions,  satisfying  the  refinement  equation 

*  =  ^  *(2  • -/?)  (1.2) 


with  corresponding  matrix  mask  B  =  (^p)p^2Z‘^  >  which  is  a  matrix  sequence  of 
(mxm)-matrices  Bp,  we  use  the  following  Kronecker  type  notion  of  convolving 
the  two  vectors: 


0  ^ 


02  *  ^ 


\  0„  *  ^  / 


Here,  the  convolution  of  a  scalar  function  0^  with  the  vector  function 
is  taken  componentwise.  This  operation  produces  a  vector  function  0  = 
(01, 02,  •  •  •  5  OmnY  m  X  n  components  of  type  0(i_l)„^+j  :=  0,  *  0j-  It  is 
not  too  hard  to  see  that  0  is  refinable  again, 


©  =  y]  c.,  ©{2  •  -7)  ,  (1.3) 

7€Z5'‘ 

where  the  refinement  mask  C  =  which  is  a  matrix  sequence  of 

(nm  X  nm)-matrices,  is  computed  as  follows: 

^7  =  4  E  (1.4) 

Here,  the  symbol  (g)  denotes  the  Kronecker  product  of  matrices. 

For  the  definition  and  some  properties  of  this  Kronecker  product,  we 
refer  to  [4,  Section  4.2].  It  should  be  noted  at  least  that  neither  the  Kronecker 
product  of  matrices  nor  the  Kronecker  type  convolution  is  commutative. 


§2.  Convergence  of  the  Convolved  Subdivision  Scheme 

Let  us  also  recall  that  a  refinable  function  vector  gives  rise  to  a  vector- valued 
subdivision  scheme  as  follows:  In  the  situation  of  (1.1),  the  subdivision  operator 
associated  with  the  refinable  function  vector  $  =  (0i,  02,  ■  •  • ,  4>nY  defined  as 

5,4  :  (t(7L'^)T  ^  {t(TL'^)T  . 

(Sa  A)„  :=  E  «  e  (2-1) 

where  denotes  the  linear  space  of  sequences  indexed  by  7L^.  The  com¬ 

plete  (stationary)  subdivision  scheme  consists  in  the  iterates  of  Sa,  namely: 

For  a  given  initial  vector  sequence  A  G 
Put  :=  A  and 

Compute  :=  Sa  fc  =  0, 1, . . . 


(2.2) 
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Hence,  the  iterate  has  components 

:=  E  (4-2'./3)'"  A^“>.  a  6  (2.3) 

Pe'Z‘^ 

with  the  iterated  matrices  =  i^a^)ae7Z'^  defined  by  A^^)  :=  A  and 

AW  :=  ,  ae7L\  for  k  >  1.  (2.4) 

Following  [2,  Section  2,4]  we  say  that  the  subdivision  scheme  converges 
for  A  =  (A^, . . . ,  A")^  €  if  there  exists  a  continuous  function  /a  : 

+  m  such  that 

=  e  =  (2.5) 

n 

Here,  ||  •  ||oo  denotes  the  sup-norm  of  the  vector  sequence  A  =  (A^, . . . ,  A")^ 
given  by 

an  :=  rnax  ||V||oo. 

[I  llcxD 

The  symbol  /a(^)  is  short  for  the  scalar-valued  sequence 

should  be  noted  that,  since  for  any  convergent  scheme  the  limit  function  /a 

is  given  by 

/a=  E  Al  *(■-«), 

we  can  recover  the  components  (/>*,  i  =  1, . . .  ,n,  as  follows:  choose  the  initial 
sequence  A  =  (A^, . . . ,  A”)^  in  the  following  way  that  A"  is  a  delta  sequence 
(ie.,  Aq  =  6a  for  a  G  TL^)  while  all  other  sequences  A-^,  j  ^  i,  are  null 
sequences.  This  special  initial  vector  sequence  and  its  iterates  will  be  denoted 
by  Ei  and  k  >Q,  respectively.  It  then  turns  out  that 

||04^)e-Ef)|L  =  O  for  i  =  (2.6) 

Theorem  2.1.  Given  two  convergent  subdivision  schemes  associated  with  the 
refinable  function  vectors  $  and  the  (Kronecker  type)  convolved  scheme 
associated  with  ©  =  4^  *  ^  is  again  convergent. 

Proof:  Let  f  :=  (11  1)^,  and  let  Fj  denote  the  vector  sequence  (com- 

m 

posed  by  m  sequences)  with  the  delta  sequence  at  position  j  and  the  null 
sequence  at  all  other  positions.  If  k  >  0,  are  the  iterated  vectors  with 
respect  to  the  ’^-subdivision,  we  have 
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in  addition  to  (2.6).  In  order  to  prove  the  theorem,  we  will  show  that 

\jm  I ((0<  *  V-,)(^))(e  ®  f)  -  (E,  »  =  0  (2.8) 

for  ?  =  1, . . . ,  n  and  j  =  1, . . . ,  m.  Here,  the  convolution  of  two  vector  se¬ 
quences  is  defined  by 


where  the  convolution  of  a  scalar  sequence  A*  with  the  vector  sequence  T  is 
taken  componentwise,  i.e.,  A*  *  F  =  (A*  *  7^  A*  +  7^, . . . ,  A*  *  7”^)^.  We  will 
also  use  the  estimate 

||A*r||^  <mm  { l|Al|J|r||^  ,  ||A||^|lr|lj }, 

where  ||A||^  :=  maxi=i^...,„  ||A^||i,  with  the  usual  1-norm  of  scalar  sequences. 

Now,  the  iterated  matrices  for  the  ©-subdivision  can  be  expressed 
by  the  iterated  matrices  and  for  the  and  ’^'-schemes  as  follows: 

^  E  e  (2.9) 

Thus,  taking  Ei  +  Fj  as  a  starting  vector  for  the  ©-subdivision,  the  iterated 
vectors  are  given  by 

=  fc>0,  (2.10) 

whence 

=  *'Ai)(^))(e®  f)  -  *  (V'i(^)f) 

+  i  -  Ff ) 

We  estimate  the  three  vector  sequences  in  the  preceding  three  lines  as 
follows:  the  first  term  is  a  vector  sequence  where  each  component  of  the 
vector  consists  of  the  sequence 
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This  is  the  error  of  a  tensor  product  rectangular  rule  applied  to  the  convolution 
integral,  and  due  to  the  continuity  and  the  compact  support  of  (j)i  and  we 
get 

lim  sup  |e^*^|=0. 

For  the  third  term, 


1 

2d-k 


and  the  bound  tends  to  zero  as  k  oo,  since 


.(^)f 


ae’Z<^ 


Finally,  for  the  second  term 

-  Ff  0  \l  <  c.  -  Ff\l  -  0 


2^‘ 

as  fc  — )•  oo.  This  follows  from 


Cfe 


(fc)| 


<  -rr 
1  -  2^-^ 


(fc)| 


2d-k 


2fe> 


since  the  first  term  on  the  right-hand  side  converges  to  \(l)i{x)\dx,  and  the 
second  term  is  bounded  due  to  (2.6),  the  uniform  continuity  of  the  compactly 
supported  function  (pi,  and  the  fact  that  for  some  compact  set  K  C  (in 
fact  we  can  take  the  support  oi  (pi): 


(Ef))„  =  0  ior^iK. 


This  latter  property  is  a  consequence  of  the  compact  support  of  the  matrix 
mask  A,  since  we  start  the  iteration  with  a  ‘delta’-sequence  Ej.  In  conclusion, 
(2.8)  holds  true.  □ 


§3.  Approximation  Order 

Approximation  orders  of  shift  invariant  spaces  have  been  studied  quite  inten¬ 
sively  in  the  past  few  years.  Concerning  definitions  and  notation  we  refer  to 
the  recent  survey  [5].  A  characterization  of  approximation  power  in  terms  of 
the  mask  symbol  is  also  given  there.  In  case  of  the  refinement  equation  (1.1), 
this  mask  symbol  is  defined  by 


o.eE'' 


(3.1) 


140 


C.  Conti  and  K.  Jetter 


The  result  is  as  follows:  For  given  A:  E  IN  we  say  that  satisfies  condition 
(Zk)  if  there  exists  a  row  vector  v  =  (ri, . . . ,  r„)  of  trigonometric  polynomials 
such  that 


v(0)4(0)#0,  v(0)H*(0)  =  v(0),  (3.2,a) 


and 


D''(v(-)H4.(-))|^  =  0  for  |/i|  <  A:  and /?  e  (3.2.5) 


where  Eq  is  short  for  the  set  of  corners  of  the  cube  [0,1]^  with  the  origin 
removed.  [5,  Theorem  3.2. 8. (i)]  then  asserts  that  condition  (Zk)  implies  that 
5$  has  L2-  approximation  order  k,  for  f  €  VF* 


Lemma  3.1.  If  the  mask  symbol  satisfies  condition  (Zk)  (with  the  row 
vector  v)  and  the  mask  symbol  satisfies  condition  (Zf)  (with  the  row  vec¬ 
tor  w )  then  the  mask  symbol  He  of  the  convolved  function  vector  Q  ^ 
satisfies  condition  Zk+e  (with  the  row  vector  z  :=  v  <S>  w). 


Proof:  We  apply  the  identity 


He  =  H^  (S)  H^ 


(3.3) 


several  times.  Condition  (3. 2. a)  holds  for  the  convolved  function  vector,  since 
(v(g)  w)(0)  (#T^)(0)  =  v(0)4(0)w(0)^(0)  #  0, 


and  by  (3.3), 

z(0)He(0)  =  (v(0)H$(0))  ®  (w{0)H^(0))  =  v{0)  C)  w(0)  =  z(0). 

In  order  to  verify  condition  (3.2.6), 

D"  (z(-)Hq(-))  1^=0  for  |/i|  <  *)  +  f  and  /3  e  £(,, 
we  make  use  of  the  Leibniz-type  formula 
I>'‘((v®w)(-)(H*®Hi,)(-)) 

=  E  (!!)  (''(■)H4.(2))  ®  •D"'"  (w(.)H^,(-)) 

at  every  point  jS  e  Eq.  □ 

In  order  to  derive  an  approximation  order  result  from  this  lemma,  we  refer 
to  the  precise  statement  of  [5,  Theorem  3.2.8].  A  sample  result  is  as  follows: 
If  the  shift  invariant  spaces  associated  with  $  and  have  approximation 
orders  k  and  f,  respectively,  and  if  the  Gramians  G^,  and  satisfy  the 
regularity  condition  given  in  [5,  Theorem  3.2.8.(ii)j,  then  the  ^convolved’  shift 
invariant  space  has  (at  least)  approximation  order  k  T  f.  However,  in  general 
the  ‘convolved’  Gramian  G©  does  not  satisfy  this  regularity  condition. 
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§4.  An  Example;  Bivariate  Cubics  on  a  3-directional  Mesh 

In  [3],  we  have  given  an  example  of  piecewise  C^-cubics  on  the  four-directional 
mesh.  Following  [1],  we  consider  piecewise  C7^-cubics  on  the  three-directional 
mesh  generated  by  the  lines  x  =  k,  y  =  I,  with  k,l  £  TL  when  adding  the 
diagonals  x—y  =  m,m^7L.  A  basis  of  this  space  is  given  by  the  two  functions 
^1  =  Biii*xTi  and  62  —  with  Bm  the  linear  three-directional  box- 

spline  (or  “Courant”  element),  and  xti  >  XTa  fhe  characteristic  functions  of  the 
two  triangles  Ti,T2  obtained  by  cutting  the  unit  square  [0, 1]^  by  the  ‘north¬ 
east’  diagonal.  Thus,  ©  =  $  +  ^  with  :=  (Bm)  and  :=  (xT^XTa)^- 
Now,  #  =  (Bill)  satisfies  a  scalar  refinement  equation  (1.1)  with  refine¬ 
ment  mask 

/  :  :  :  :  ; 

•••0  0  0  0  0 

•••  0  0  i  i  0 

•A  =  =  •  •  •  0  i  1  I  0 

•••  0  1  i  0  0 

•••0  0  0  0  0 

and  ^  =  (xTuXTi)^  satisfies  the  refinement  equation  (1.2)  with  matrix  maisk 


Here,  the  indexing  of  the  ‘coefficients’  is  such  that  the  boldface  entry  is  at 
position  a  =  (0, 0). 

It  follows  that  0  =  satisfies  the  refinement  equation  (1.3),  with  the 
matrix  mask  C  as  displayed  on  the  following  page,  and  Theorem  2.1  provides 
the  convergence  property  for  the  associated  subdivision  scheme. 

Concerning  approximation  order.  Lemma  3.1  can  be  applied  by  putting 
k  =  2,  £  =  1^  and  z  :=  (1)  (1  1).  As  a  consequence,  the  approximation  order 

for  bivariate  C^-cubics  on  the  the  three-directional  mesh  is  at  least  3,  and  this 
is  the  precise  approximation  order  (as  was  proved  in  [1]). 
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Interpolating  Polynomial  Macro-Elements 
with  Tension  Properties 


Paolo  Costantini  and  Carla  Manni 


Abstract.  In  this  paper  we  present  the  construction  of  nine-parameter 
polynomial  macro  elements,  based  on  the  classical  Powell-Sabin  split, 
which  can  be  connected  to  form  a  surface.  Variable  degrees,  which 
act  as  independent  tension  parameters,  are  associated  with  any  vertex  of 
the  triangulation,  i.e.  to  any  interpolation  point. 


§1.  Introduction 

Among  the  several  approaches  for  avoiding  extraneous  inflection  points  in 
interpolating  functions,  the  so  called  tension  methods  are  the  oldest  and  prob¬ 
ably  the  most  famous  ones.  Basically  they  consist  oi  k  >  1,  piecewise 
functions  depending  on  a  set  of  parameters  which  are  selected  in  a  local  or 
global  way  to  control  the  shape  of  the  interpolants,  stretching  their  patches 
between  data  points. 

Piecewise  polynomial  splines  with  variable  degrees  have  turned  out  to  be  a 
useful  alternative  to  classical  (exponential  or  rational)  tension  methods,  and 
have  been  successfully  applied  both  in  free-form  design  and  in  constrained 
interpolation  of  spatial  data  [1].  On  the  other  hand,  a  limited  number  of 
methods  for  constrained  interpolation  of  bivariate  scattered  data  are  at  present 
available,  and  very  few  of  them  off'er  the  possibility  of  controlling  the  shape 
via  tension  parameters  (see  for  example  [2,3]  and  references  quoted  therein). 

Let  a  set  of  scattered  points  {xi,yi,fi),  i  =  be  given,  and 

suppose  they  have  been  associated  with  a  proper  triangulation  T.  We  are 
interested  in  local  methods  where  the  polynomial  pieces  of  a  spline  are  de¬ 
termined  one  triangle  at  time  using  only  local  data.  Such  methods  are  called 
macro-element  methods.  The  aim  of  the  paper  is  to  describe  a  new  class  of 
variable  degree  polynomial  triangular  macro-elements,  and  to  show  their  ten¬ 
sion  properties.  These  polynomial  patches,  which  are  based  on  the  classical 
Powell-Sabin  split  of  a  triangle,  can  be  connected  to  form  a  interpolating 
function. 
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Typically,  the  main  drawback  of  macro- element  methods  is  in  the  strong 
influence  of  the  triangulation  on  the  shape  of  the  interpolating  surface,  and, 
as  far  as  we  know,  it  is  still  not  clear  how  to  construct  a  “good”  triangulation. 
Given  that  any  interpolating  scheme  based  on  triangular  macro-elements  can¬ 
not  be  completely  independent  of  the  triangulation,  we  can  nevertheless  try 
to  reduce  this  dependence.  From  this  point  of  view,  the  main  advantage  over 
existing  methods  is  in  the  fact  that  we  have  a  variable  degree  associated  to 
each  vertex  of  the  triangulation;  therefore  the  modification  in  the  shape  of 
the  interpolant  is  similar  for  all  the  patches  around  the  same  common  vertex 
(see  Fig.  8  right).  In  other  words,  although  constructed  over  T,  the  tension 
parameters  are  more  related  to  the  interpolation  points  than  to  the  edges  of 
the  adopted  triangulation. 

It  is  worthwhile  to  anticipate  that  the  possibly  high  degrees  we  use  for  our 
construction  act  only  as  tension  parameters,  and  do  not  modify  the  basic  struc¬ 
ture  of  the  macro-element.  In  other  words,  we  always  use  a  nine-parameter 
macro-element,  and  the  computational  complexity  is  almost  independent  of 
the  size  of  the  degrees  used. 

The  scheme,  being  local,  requires  that  the  gradients  are  also  known  at  the 
vertices  of  the  triangulation.  If  this  information  is  unavailable,  gradients  can 
be  recovered  from  the  data  points.  In  the  presented  numerical  test,  gradients 
have  been  computed  from  the  data  according  to  the  classical  least  square 
strategy. 

The  paper  is  divided  into  six  sections.  In  the  next  section  we  introduce 
some  notations.  Sections  3  and  4  are  devoted  to  the  construction  of  the  control 
points  defining  the  macro-element,  whose  properties  are  briefly  discussed  in 
Section  5.  We  end  with  Section  6  where  a  graphical  example  is  presented. 

§2.  Notation  and  Preliminaries 

In  this  section  we  introduce  some  notation.  To  aid  in  comprehension,  we 
notice  that  points  and  vectors  in  IR^  and  in  have  been  denoted  by  bold¬ 
faced  characters  unless  classical  notations  (as  for  gradients)  have  been  used. 
As  usual,  we  describe  the  polynomial  macro-element  in  terms  of  its  Bernstein- 
Bezier  control  points.  Let  P^,  r  =  1, 2, 3,  be  three  non-collinear  points  in  IR^, 
and  let  T  denote  the  triangle  they  form.  An  n-degree  Bernstein  polynomial 
has  the  form 


h{x^y\n)  =  b{u,v,vj;n)  := 


E 

i+j-}-k=n 


n\ 


Ujk 


where,  setting  P  =  [a;,  e  u  =  u{x,y),  v  =  v(x^y),  w  —  w{x^y),  are 
the  barycentric  coordinates  of  P  with  respect  to  the  vertices  of  T,  that  is 

P  =  wPi  +  vP2  +  wPa;  u  +  V  w  =  I, 


and  lijk,  i  -I-  i  =  n,  are  the  Bernstein  ordinates  of  6(., ., .;  n). 
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Fig.  1.  Powell-Sabin  split  of  a  triangle. 


Setting  X  =  x{u^v,w)^  y  =  y{u,v,w),  the  points  in  IR^ 
Ujk 


j)  ^  >  0  )  2  j  +  A:  =  n, 


are  called  control  points  [4]. 

We  finally  recall  the  so-called  Powell-Sabin  split  ([5])  of  a  given  triangle, 
T,  which  consists  of  dividing  T  :=  P1P2P3  into  six  mini-triangles  (Fig.  1): 

7(1.0)  _  P2M1R,  :=  P3M2R,  :=  P1M3R, 

7(1,1)  _  M1P3R,  :=  M2P1R,  :=  M3P2R, 


where 

R  =  /?iPi  -I-  P2P2  +  ^3^3  j  /^1  +  /32  +  /?3  =  1, 

is  a  point  internal  to  T  and 


h/If  —  (1  —  Q:i)P i+i  +  tt^P t+2?  0  <  Clj  <  1, 

is  a  point  internal  to  the  edge  of  T  opposite  to  P^.  Here,  and  in  the  following, 
indices  will  be  considered  modulus  3. 

We  will  denote  by  ,  LiJ*  the  control  points  of  Bernstein  polynomi¬ 
als  in  the  mini  triangle  (see  below).  Let  the  data 

(Pi,/i  =  /(P<),v/i  =  v/(P^)),PieR^  i  =  1,2,3,  (1) 

be  given.  As  mentioned  in  the  introduction,  our  goal  is  to  construct  a 
polynomial  macro-element  on  T  interpolating  the  data  and  having  tension 
properties. 

The  macro-element  will  be  obtained  considering  a  Powell-Sabin  split  of 
r,  and  constructing  in  each  mini  triangle  T^p>^^  a  Bernstein  polynomial  via 
suitable  control  points,  To  obtain  the  final  control  points  (FCP) 

we  follow  two  basic  steps:  first  we  construct  the  basic  control  points  (BCP) 

,  then  we  modify  them  to  reach  the  required  smoothness  of  the  macro¬ 
element. 


146 


P.  Costantini  and  C.  Manni 


§3.  Defining  the  Basic  Control  Points 

For  each  vertex  P^,  let  us  consider  an  associated  given  degree 

‘^i  >3,  Hi  e  IN. 


Let  us  now  describe  the  construction  of  the  BCP  considering  for  the  sake  of 
simplicity  only  the  mini  triangle  See  Fig.  2  for  the  role  of  indices. 

First  of  all  we  assume  interpolation  conditions  (for  the  position  and  the 
gradient)  at  Pi  (see  in  Fig.  3  left)  that  is 


^  ’  ni 

In  order  to  define  the  BCP  around  M3,  let  us  consider  the  univariate  piecewise 
linear  function,  /s,  defined  along  the  edge  P1P2  having  breakpoints  at 

Pi,  P1  +  -P1M3,  P2-— M3P2,  P2, 

ni  712 


interpolating  /  and  its  derivatives  at  the  extremes  of  the  edge.  We  define  (see 
“(g)”  in  Fig.  3  left) 


C?,o  :=  i3(M3), 

:=  '3  (m3  -  ^PiM3)  , 

C-1.1  '3(M3)  + 

’  ’  Til 


(3) 


where 

di  :=  ((1  -  a3)V/(Pi)  +  a3V/(P2),  M3R) . 

Moreover,  we  require  that  i  (see  “0”  in  Fig.  3  left)  belongs  to  the 

plane  through 

j  (3,0)  j  (3,0)  j  (3,0) 

^l,ni-l,0>  ^0,ni,05  ^0,ni-l,l- 

Concerning  the  remaining  control  points,  we  assume  that  the  not  yet  defined 
control  points  of  the  first  two  rows  in  the  mini  triangle  parallel  to  P1P2  (see 
“o”  in  Fig.  3  left)  belong  to  the  straight  lines  through  the  above  defined  control 
points,  that  is 


,(3,0)  _  ^  -  1  T  (3,0)  ,  (^1  -  2)  -  (7  -  1)t  (3,0) 

Hni-i,0  l,ni-l,0  » 

T  (3,0)  _  ^  1  T  (3,0)  ,  (^1  -  2)  -  {i  -  1)  ^  (3,0) 

i  =ni  -  2, ...  ,2  . 


(4) 
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Fig,  2.  The  role  of  indices  of  the  control  points  (projection  onto  the  x,t/  plane) 
for  the  split  of  Figure  1  with  ni  =  5,  n2  =  3,  ns  =  8. 


Fig.  3.  The  construction  of  the  BCP  for  the  split  of  Figure  1  with  nj  =  5,  n2  = 
3,  ns  =  8.  Left:  projection  of  in  the  x,y  plane;  are  deter¬ 

mined  via  interpolation  conditions  (2);  “0”  by  (3);  “o”  by  (4)  and  “o” 
by  coplanarity  conditions.  Right:  p  =  1,2,  3,  g  =  0, 1.  Note  the 

discontinuities  across  the  edges  MjR. 


Moreover,  we  require  that  conditions  for  continuity  across  the  edge  PiR 
hold  [4].  Then,  in  particular, 


{2, 


L 


(2,1) 

0,ni-l,l> 


L 


(2,1) 

0,ni-2,2 


T  (3, 

■^ni-2,0,25 


j  (3,0)  j  (3,0) 

^ni-l,0,lJ-^ni-2,l,l) 


lie  onto  the  same  plane. 
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Fig.  4.  Left:  the  control  points  after  the  first  step,  n  ~  8.  Right:  The 


final  control  points  after  the  second  step.  as  in  Fig.  3. 


(p.<7) 


Finally,  the  central  control  points  fc  >  2,  (see  “o”  in  Fig.  3  left) 

are  assumed  to  lie  onto  the  plane  through 

j  (3,0)  ,  (1,0)  j  (2,0) 

•^ni-2,0,2>  ^n2-2,0,2?  ^713-2,0, 2- 

Similarly  we  define  the  BCP  in  the  other  mini  triangles. 

The  above  construction  provides  a  polynomial  macro-element  which 
turns  out  to  be  of  class  across  the  interior  edges  PfR. 

Moreover,  we  assume  that  the  points  M  lying  on  the  internal  edges  of  the 
initial  triangulation  T  also  lie  on  the  straight  lines  joining  the  R  points  of  the 
triangles  which  those  edges  separate  (see  Fig.  8).  This  classical  requirement, 
(2),  (3),  (4),  and  the  geometry  of  the  Powell-Sabin  split  ensure  continuity 
of  two  macro-elements  across  the  boundary  edge  PiPi-}-i  (see  Fig.  3  right). 

On  the  other  hand,  the  BCP  do  not  produce  in  general  a  continuous 
macro-element  across  the  edges  MjR  unless  the  degrees  nj  are  equal  (see 
Fig.  3  right).  In  order  to  obtain  a  macro-element  without  imposing  any 
conditon  on  the  degrees,  we  modify  the  constructed  BCP.  This  will  be  de¬ 
scribed  in  the  next  section. 

§4.  Obtaining  a  Macro  Element 

In  this  section  we  describe  how  to  modify  the  BCP  to  obtain  the  final  control 
points  (FCP)  producing  a  macro-element.  The  modified  FCP  will  be 
basically  obtained  from  the  BCP  via  the  degree-raising  process  in  two  steps. 
As  a  first  step,  for  each  mini  triangle  let  us  compute 

>  b i,  ^  >  0  ,  i  +  i  -f  A:  =  n, 

the  control  points  obtained  from  z,  j.  A:  >  0  ,  i  +  j  +  k  =  rip+^+i,  by 

the  degree-raising  process  ([4])  from  the  degree  (that  is  the  degree 

associated  with  the  mini  triangle  to  the  degree  n  :=  max{ni, n2, 713} 

(see  Fig.  4  left).  The  control  points  allow  us  to  express  the  polynomial 

of  degree  defined  by  ^  Bernstein  polynomial  of  degree  n. 
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We  emphasize  that,  due  to  the  geometry  of  the  split,  the  control  points 
define  a  macro-element  which  is  of  class  across  the  interior  edges 
MpR  if  and  only  if  the  control  points 


=  0,  •  •  •  ,  n  —  1, 


are  collinear.  Moreover,  we  notice  that,  due  to  the  construction  of  the  BCP 
around  Mp  and  near  R  and  to  the  properties  of  the  degree-raising  process, 
the  control  points 


-(p,0)  =r-(p,0)  _  =r-(p,l)  :i-(p,l) 

^l,n-l-fc,fc5  ^0,n-A:,fc  — -^n-fc.O.fcJ  -^n-fc-l.l.fcs 


fc  =  0,  l,n  —  1, 


lie  on  the  same  plane,  then  they  are  collinear  due  to  the  geometry  of  the  split 
(see  Fig.  4  left).  Then  in  order  to  obtain  continuity  across  the  edge  MpR, 
we  simply  consider  a  second  step  in  which  we  modify 


i^(p.o)  T-(p.i) 

^0,n-fc,jk)^n-A:,0,fc> 


k  =  2, '  •  •  ,n  —  2,  p  =  1, 2, 3, 


imposing  that  (see  Fig.  4  right)  they  lie  on  the  segment  through 


(5) 


A— l,fc’  A  — 1,1, A) 


fc  =  2,-- 


n-2,  p  =  1,2,3. 


§5.  Properties  of  the  Macro— Element 

In  this  section  we  analyze  the  interpolation,  smoothness  and  tension  properties 
of  the  macro-element  defined  by  the  FCP  constructed  in  Section  4.  First  of  all, 
we  notice  that  the  construction  of  the  macro-element  is  completely  local:  it 
only  depends  on  the  data  at  the  vertices  of  T  and  on  the  degrees  nj  associated 
with  the  vertices  which  are  given  input  parameters. 

Theorem  1.  The  polynomial  macro-element  deSned  by  the  control  points 

^i,j,A  ?  P  “  1)  2, 3,  q'  =  0, 1,  i,j,k^0,  i  -i-  j  -i-  k  =  n, 

interpolates  the  data  (1)  and  is  of  class  C'^(T).  Moreover,  let  T  be  a  given 
triangulation  equipped  with  a  classical  Powell-Sabin  split,  and  with  a  fixed 
degree  associated  with  any  vertex.  Then  the  collection  of  the  macro-elements 
corresponding  to  the  triangles  of  T  produces  an  interpolating  surface  of  class 

CK 

Proof:  The  BCP  defined  in  Section  3  produce  an  interpolating  macro¬ 
element  which  is  of  class  across  the  interior  edges  P^R  and  the  boundary 
edges  PiPj^-i.  After  the  degree-raising  process,  only  the  control  points  in 
(5)  are  modified  in  order  to  obtain  continuity  across  MjR.  Since  the 
continuity  across  one  edge  only  depends  on  the  control  points  lying  on  the 
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m  =  n2  =  na  =  3 


m  =  712  =  3,  ”3  =  6 


ni  =  3,  n2  =  ”3  =  6 


”1  =  ”2  =  ”3  =  15 


Fig.  5.  The  local  tension  effect  of  the  degrees:  vertices  are  numbered  counter¬ 
clockwise  from  the  origin. 

first  two  rows  parallel  to  that  edge  ([4]),  the  second  step  in  Section  4  does  not 
affect  continuity  across  PjR  and  PiPj+i.  □ 

As  mentioned  before,  the  degrees  Uj  are  free  input  parameters.  From 
the  construction  described  in  Section  3,  it  is  clear  that  increasing  their  val¬ 
ues  causes  the  BCP  to  approach  the  plane  interpolating  (Pi,/i),  i  —  1,2,3. 
Similarly,  the  FCP  approach  the  same  plane  because  they  have  been  obtained 
via  a  degree-raising  process,  that  is  via  a  convex  combination  of  the  BCP. 
Therefore,  the  same  property  is  shared  by  the  macro-element,  due  to  the  con¬ 
vex  hull  property  of  the  Bernstein  representation.  We  summarize  the  tension 
properties  of  our  macro-element  with  the  following  theorem. 

Theorem  2.  If  ni, 712,723  — >  -foo,  then  the  polynomial  macro-element  de- 
fined  by  the  control  points 

p=l,2,3,q  =  0,1,  ij,k  >  0  ,  i  +  j  +  k  =  n, 

approaches  the  plane  through  (Pj,  /i),  z  =  1, 2, 3. 

We  end  this  section  by  emphasizing  that  the  degrees  act  as  local  tension 
parameters:  each  degree  affects  the  shape  of  the  interpolating  surface  only 
around  the  associated  vertex  (see  Fig.  8  right),  and,  as  previously  said,  this  is 
the  main  feature  of  this  method.  The  increase  of  a  degree  pushes  the  surface 
to  the  piecewise  linear  interpolant  around  the  corresponding  point,  giving  it 
a  cuspidal  appearance.  This  local  tension  effect  is  clearly  shown  in  Fig.  5. 
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§6.  Numerical  Results 

In  this  section  we  present  a  classical  graphical  example  to  show  the  perfor¬ 
mances  of  the  macro-element  and  the  local  tension  effect  of  the  degrees. 

The  data  (Fig.  6  left)  are  taken  from  [6]  (see  also  Fig.  8  left  for  the  consid¬ 
ered  triangulation).  The  interpolating  surfaces  obtained  by  using  the  proposed 
macro-element  with  uniform  degrees  nj  =  3  is  depicted  in  Figure  6  right.  Fig¬ 
ure  7  left  shows  the  interpolating  surfaces  obtained  by  using  uniform  degrees 
Uj  =9.  The  tension  effect  due  to  the  increased  values  of  the  degrees  is  evi¬ 
dent,  but  obviously  uniformly  distributed  over  all  of  the  surface.  On  the  other 
hand,  the  surface  in  Figure  7  right  has  been  obtained  considering  all  the  de¬ 
grees  equal  to  3,  except  those  associated  with  the  five  vertices  as  depicted  in 
Figure  8  left.  The  local  tension  effect  of  the  degrees  is  clear.  The  influence 
region  of  the  increased  degrees  can  be  also  seen  in  Figure  8  right. 
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Quantized  Frame  Decompositions 

M.  Craizer,  D.  A.  Fonini,  Jr.,  and  E.  A.  B.  da  Silva 


Abstract.  In  this  paper,  we  consider  a  certain  type  of  decomposition 
of  vectors  in  frames,  in  which  the  coefficients  are  already  quantized  and 
thus  are  ready  for  coding.  This  decomposition  is  a  generalization  for  vec¬ 
tors  of  the  usual  binary  expansion  of  real  numbers,  and  the  algorithm  for 
obtaining  it  can  be  seen  as  a  quantized  version  of  the  matching  pursuits  al¬ 
gorithm.  We  show  that,  in  several  cases,  applying  this  algorithm  is  better 
than  first  finding  the  frame  coefficients  and  then  quantizing  them. 


§1.  Introduction 

Let  T  —  {ci,  62, . . . ,  Cp}  be  a  collection  of  unit  vectors  generating  This 
means  that  every  x  G  can  be  expressed  as 

V 

X  =  Y^aiei. 

1=1 

The  vectors  {ci,  62, . . . ,  Cp}  may  or  may  not  be  linearly  independent.  In  the 
case  that  they  are  linearly  dependent,  the  set  T  is  called  a  frame  or  an  over¬ 
complete  basis.  In  this  paper,  we  shall  call  J-  a  frame  even  if  the  vectors 
{ei,  62, . . . ,  6p}  are  linearly  independent.  More  on  frame  expansions  can  be 
found  in  [7]. 

Let  q  =  2p  and,  for  1  <  i  <  p,  let  Vi  =  6*  and  Vj+p  =  — 6^.  We  shall  call 
the  set  V  =  {vi, . . . ,  Vg}  a  codebook  or  a  dictionary.  Let  a  be  a  real  number 
in  the  interval  (0, 1).  A  representation  of  a  vector  x  G  in  the  form 

CX3 

X  = 

i=0 

with  Vki  G  X>,  will  be  called  an  (a,  X>)-expansion.  When  the  dictionary  being 
used  is  clear  by  the  context,  we  shall  call  this  representation  simply  an  a- 
expansion.  Observe  that  the  (a,  X>)-expansion  of  a  vector  x  can  be  seen  as  a 
decomposition  of  x  in  the  frame  T. 
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Define  the  n~  residual  of  a  vector  x  by 

f  =  ifn  =  0, 

I  a;  _  a'vk, ,  if  n  >  0. 


Given  x,  the  sequence  {ko,ki,. . .)  can  be  obtained  recursively  by  the  relation 


{rn{x),Vk^)  =  max(r„(a;),Vfc) . 
k 

We  shall  call  this  algorithm  the  nearest  point  algorithm  and  it  may  be  seen  as 
a  quantized  version  of  the  matching  pursuits  algorithm  [4] . 

Denote  by  =  Aa{T>)  the  set  of  points  of  IR^  that  can  be  represented 
as  an  a-expansion  of  vectors  that  belong  to  X>,  and  by  A®  =  A^(X>)  the  subset 
of  Aa{V)  whose  a-expansion  can  be  obtained  by  the  nearest  point  algorithm. 
In  order  for  the  a-expansion  or  the  nearest  point  algorithm  to  be  a  suitable 
scheme  for  quantized  frame  decomposition,  we  must  choose  a  such  that  Aq 
or  A^,  respectively,  contain  an  open  set  of  IR^.  In  Section  2,  we  shall  give 
conditions  on  a  that  guarantee  these  facts. 

At  this  point,  a  question  arises:  is  it  worthwhile  to  decompose  a  vector 
in  a  frame  using  the  a-expansion,  or  is  it  better  to  decompose  it  in  the  usual 
way  and  then  quantize  the  coefficients  in  a  second  step  (see  [6,9])?  We  shall 
answer  this  question  by  considering  the  rate-distortion  characteristic  of  each 
scheme.  We  show  in  Section  3  that  the  first  scheme  is  better,  in  an  asymptotic 
sense,  if  and  only  if  we  can  choose  a  satisfying 

<p. 

We  shall  also  give  examples  where  this  inequality  holds. 

Take  a:  =  (xi, . . .  ,xn)  G  We  can  quantize  x  by  taking  the  n-term 
binary  representation  of  each  coordinate  Xj .  This  procedure  can  be  considered 
as  an  n-term  a-expansion  using  the  dictionary  Bn  whose  code  vectors  are 
the  corners  of  the  hypercube  [-1,1]^  and  a  =  |.  So  the  a-expansion  in  an 
arbitrary  dictionary  V  can  be  considered  as  a  generalization  of  the  usual  binary 
expansion  for  vectors.  The  relevant  question  is  whether  there  is  any  dictionary 
V  that  is  better  in  some  sense  than  Bn-  It  is  worthy  of  note  that  some  special 
dictionaries,  related  to  the  sphere  packing  problem  [2],  have  already  been  used 
for  image  coding,  yielding  better  results  than  Bn  ([3,8]). 


log2(2p) 


§2.  Theory  of  Alpha- Expansions 
General  representation 

Let  V  —  Vg}  be  a  collection  of  vectors  that  generates  all  IR^,  and 

0  <  a  <  1  be  a  parameter.  Denote  by  A^  the  set  of  points  x  G  that  can 
be  written  as 

oo 

X  =  ^a*Vfc^ 

t=0 
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with  Vki  G  D.  Let  be  the  convex  hull  of  the  vectors  { 

Observe  that  any  a:  G  is  a  linear  combination  of  the  vectors  {vA:}fe=i,...,g 
with  coefficients  whose  sum  is  not  larger  than  and  therefore  Aa  C  Pa- 
Define  the  contracting  maps  fk  =  fk,a  by  fk{x)  =  ax  +  for  k  G 
{1, . . .  ,g}.  We  observe  that  fk  is  a  homotety  of  center  j—Vjfc,  which  implies 
that  fkiPa)  C  Pa-  Therefore  the  set  forms  an  iterated  function 

system  (IFS)  [1]  on  Pa-  It  is  not  difficult  to  show  that  the  attractor  of  this 
system  is  exactly  A^,  i.e., 

oo 

A„  =  fl 

n—0 

where  F  =  Fa  is  the  function  of  sets  defined  by 

F(A)  =  /i(A)U...U/,(A). 

Example  1.  Let  V  =  the  dictionary  whose  code  vectors  are  the  corners 
of  the  hypercube  [—1,1]^.  For  any  a  >  we  have  that  F(Pa)  =  Pa  and 
therefore  Aa  —  Pa - 

We  are  interested  in  finding  the  smallest  value  of  a  such  that  Aa  contains 
an  open  set.  In  the  above  example,  this  occurs  for  a  =  when  in  fact 
Aa  =  Pa - 

Remark  1.  One  can  show  that  for  any  dictionary,  if  a  >  then  Aa  =  Pa  ^ 
which  shows  that  the  smallest  value  of  a  such  that  Aa  contains  an  open  set 
is  smaller  than  . 

In  Example  1,  the  smallest  a  such  that  Aa  contains  an  open  set  satisfies 
also  Aa  —  Pa-  But  this  is  not  a  general  fact,  as  the  following  example  shows. 

Example  2.  Let  V  =  B3U  {(1,0, 0),(— 1,0,0)},  where  Bz  is  the  dictionary 
whose  code  vectors  are  the  corners  of  the  cube  [—1,1]^.  If  we  consider  a  = 
then  [-1,1]^  C  Aa,  but  Aa  is  strictly  contained  in  Pa-  This  fact  can 
be  seen  by  observing  that  the  centroid  of  the  face  of  Pa  whose  vertices  are 
(1,0, 0),  :^(1, 1)  1)  and  :^(l5 1,  -1)  are  not  contained  in  F{Pa),  which  implies 
that  F{Pa)  Pa^  and  thus  Aa  ^Pa¬ 
in  all  examples  that  we  have  considered,  we  observe  that  if  Aa  contains 
an  open  set  of  then  it  also  contains  the  convex  hull  of  some  of  the  points 
of  the  dictionary.  We  don’t  know  whether  this  is  always  true,  so  we  formulate 
it  as  a  question: 

Question  1.  If,  for  some  0  <  a  <  1,  Aa(X>)  contains  an  open  set  of  R^,  then 
will  a  subdictionary  t>i  CV  always  exist  such  that  Aa{D)  2  Pa{Di)? 
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Basic  algorithm 

How  do  we  obtain  the  sequence  of  indexes  (fco,  •  -)  that  represent  a  given 
vector  X  G  Aa?  In  general,  the  representation  of  a  vector  x  is  not  unique.  In 
order  to  define  which  of  the  sequences  representing  the  vector  x  we  shall  look 
for,  we  consider  a  choice  function  K  :  F{Pa)  {1, . . . ,  qf}  with  the  following 
properties; 

1)  fk{Vk)  C  14,  for  any  fc  G  where  T4  =  K-\k). 

2)  If  K(x)  =  k,  then  x  G  fk{Pa),  for  any  x  G  F{Pa). 

It  can  be  shown  that  such  a  function  always  exists.  This  choice  function  K 
determines  a  function  g  :  F{Pa)  — >•  Pa  given  by 

S(x)  = 

a 

It  is  not  difficult  to  show  that 

A„  =  n“ 

This  implies  that  if  x  G  Aq,  then  ^*(x)  G  Aa,  for  any  i  >  0. 

By  the  last  paragraph,  given  x  G  A^,  we  can  choose  the  sequence 
{ko,ki,..,)  by  the  relation  ki  =  K{g^{x)).  We  shall  call  this  the  basic  al¬ 
gorithm.  This  algorithm  always  works,  but  it  is  computationally  expensive. 
So  we  propose  another  algorithm,  computationally  feasible,  called  the  nearest 
point  algorithm. 


Nearest  point  algorithm 


The  nearest  point  algorithm  is  used  for  obtaining  a  sequence  {ko,ki,. ..)  repre¬ 
senting  a  vector  x  G  Aq.  It  can  be  seen  as  a  quantized  version  of  the  matching 
pursuits  algorithm. 

It  is  determined  by  the  choice  function  Kq  defined  by  the  property  that 
Ko{x)  is  the  code  vector  in  V  nearest  to  x.  We  denote  by  Vo.fc  the  set  KQ^{k) 
and  by  go  the  function 


9o{x)  = 


X  -  V/Co(X) 


This  choice  function  certainly  satisfies  Property  1  above,  but  Property  2  can 
fail.  It  is  not  difficult  to  see  that  Property  2  holds  if  and  only  if  Vo.fc  C  fkiPa), 
for  every  k  G  {1, . . . ,  g}. 

Let  A^ (X>)  =  n^^i5^”(Pa).  We  have  that  A°  (T>)  C  Aa(T>),  but  they  are 
not  necessarily  equal.  One  can  verify  that  A5^(X>)  =  Aq(X>)  if  and  only  if  the 
choice  function  Kq  satisfies  Property  2  above. 


Example  3.  Let  V  =  {(1, 0),  (0, 1),  (-1, 0)}.  For  any  0  <  a  <  1,  the  segment 
(0, ^),0  <  (5  <  1  is  not  contained  in  /2(Fq).  Therefore  go{0,S)  is  not  in  Pa, 
which  implies  that  this  segment  is  not  in  A®  (2?),  On  the  other  hand,  ifa>  |, 
Aa{V)  =  Pa^ 

We  have  also  observed  in  examples  that  if  A5^(X>)  contains  an  open  set  of 
]R^,  then  it  must  contain  the  convex  hull  of  some  code  vectors.  This  prompts 
the  following  question; 
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Question  2.  If,  for  some  0  <  a  <  1,  A5[(D)  contains  an  open  set  ofJR^,  then 
will  there  be  a  subdictionary  T>i  CV  such  that  A°  (7?)  D  Pa(Di)? 


§3.  Comparison  between  Alpha-Expansions  and  the  Decompose- 
Quantize  Procedure 

In  this  section  we  shall  compare  the  a-expansion  in  a  frame  with  the  2- 
step  procedure  of  first  decomposing  in  the  frame  and  then  quantizing  the 
coefficients  so  obtained  in  a  second  step.  We  shall  do  this  by  comparing  the 
rate-distortion  functions  of  each  scheme. 

Let  T  =  {ei,  €2, . . . ,  Cp]  be  a  frame  in  IR'^,  and  take  x  €  3R^  with  ||a:||  < 
M.  We  shall  assume  that  the  coefficients  (ai,  02, . . . ,  Op)  of  the  decomposition 
X  in  the  frame  T  satisfy  |ai|  <  CjM,  for  some  constant  Ci  that  depends  only 
on  T.  We  shall  consider  here  the  quantization  of  these  coefficients  by  binary 
expansions.  If  each  coefficient  is  represented  by  n  bits,  the  total  number  of 
bits  used  is  R  =  np,  and  the  maximum  square  error  per  coefficient  is  given 

by  ^CiM  .  If  we  multiply  this  by  p,  we  obtain  the  total  maximum 

square  distortion  D.  Therefore,  we  can  write  the  rate-distortion  relation 


Let  us  consider  now  the  a-expansion  procedure.  If  we  approximate  x  G 
Aa,  ||x||  <  M,  by  its  n-term  a-expansion  J,  the  maximum 

square  distortion  is  given  by  D  =  [C2Ma^]'^ ,  where  C2  is  a  constant  that 
depends  only  on  T.  The  number  of  bits  necessary  to  code  this  sequence  is 
R  =  nlog2(2p),  and  thus  we  have  the  rate-distortion  relation 


log2  (2p) 
21og2  (i) 


We  conclude  that  asymptotically,  the  a-expansion  is  better  than  the 
decompose-quantize  procedure  if  we  can  choose  a  such  that  Aq,  contains  an 
open  set  and 

log2(2p)^„ 

- - -nT<V  (1) 

(i) 


Example  4.  Let  F  =  {(^,  |),  (1,0),  (-^,  |)|  and  take  a  =  Then 
Aa  =  Fa  (which  in  this  case  is  a  hexagon)  and 


=  loggfi  <  3, 

which  implies  that  in  this  case  the  a-expansion  is  better  than  the  decompose- 
quantize  procedure. 


lQg2(2p) 
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«  10' 


10-* 

0 

Fig.  1.  The  8-dimensional  hypercube  as  codebook. 

Remark  2.  If  for  a  given  frame  T,  Aa  contains  an  open  set,  then  this  remains 
true  for  any  other  frame  T\  obtained  from  T  by  adding  some  more  vectors. 
Hence,  even  if  relation  (1)  does  not  hold  for  T,  it  will  hold  for  highly  redundant 
frames  containing  T. 

Remark  3.  By  Remark  1,  a  >  jJ—  implies  that  Therefore,  if  we 

consider  frames  satisfying 

(2P) 

P 

the  a-expansion  scheme  will  be  better  than  the  decompose-quantize  procedure. 

Experimental  results 

In  order  to  directly  compare  our  method  (the  nearest  point  algorithm)  with 
some  established  results,  we  look  at  some  examples  presented  in  [6]  (3.4.2,  pp. 
41-45).  To  this  end,  we  used  a  similar  source  and  the  same  codebook, 

A  zero-mean  gaussian  AR  source  with  correlation  coefficient  p  —  0.9  was 
used  to  provide  the  data  points.  Vectors  were  formed  by  blocks  of  N  samples. 
Rate  was  measured  as  the  first  order  entropy  of  the  index  stream  produced 
by  the  algorithm  -  similarly  to  [6]. 

In  Fig.  1,  we  used  the  vertices  of  the  8-dimensional  hypercube  as  the 
codebook.  In  this  case,  a  was  set  to  0.501  and  |[  Vk  1|  =  1.4142  (that  is  ^). 

As  can  be  seen  from  this  data,  our  method  does  give  some  performance 
benefits  on  low  bit-rates. 

Although  our  algorithm  can  be  very  expensive  in  terms  of  computational 
effort,  so  are  other  greedy  algorithms  like  matching  pursuits.  But  there  are 
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some  well  structured  codebooks  which  lend  themselves  to  fast  calculation  of 
the  steps  involved  -  like  the  one  used  in  this  example. 


§4.  Conclusions 

In  this  paper  we  have  further  developed  the  theory  of  a-expansions  and  applied 
it  in  the  context  of  quantized  frame  expansions. 

We  have  shown  that  a-expansions  perform  asymptotically  better,  in  a 
rate-distortion  sense,  than  the  decompose-quantize  method.  In  addition,  pre¬ 
liminary  experimental  results  indicate  that  this  method  also  compares  favor¬ 
ably  to  the  decompose-quantize  method  in  practical  cases.  This  was  verified 
by  direct  comparison  between  our  method  and  quantized  matching  pursuits 
from  [6]. 
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Cubic  Spline  Interpolation  on 
Nested  Polygon  Triangulations 

Oleg  Davydov,  Gunther  Niirnberger,  and  Prank  Zeilfelder 


Abstract.  We  develop  an  algorithm  for  constructing  Lagrange  and  Her- 
mite  interpolation  sets  for  spaces  of  cubic  -splines  on  general  classes  of 
triangulations  built  up  of  nested  polygons  whose  vertices  are  connected 
by  line  segments.  Additional  assumptions  on  the  triangulation  are  signifi¬ 
cantly  reduced  compared  to  the  special  class  given  in  [41.  Simultaneously, 
we  have  to  determine  the  dimension  of  these  spaces,  which  is  not  known 
in  general.  We  also  discuss  the  numerical  aspects  of  the  method. 


§1.  Introduction 

In  contreist  to  univariate  splines,  it  is  a  non-trivial  problem  to  construct  even 
one  single  set  of  interpolation  points  for  bivariate  spline  spaces.  Such  interpo¬ 
lation  sets  for  5J(A),  the  space  of  splines  of  degree  q  and  smoothness  r,  were 
constructed  for  crosscut-partitions  A  (see  the  survey  [9]  and  the  references 
therein).  For  general  triangulations  A,  interpolation  sets  were  constructed  for 
‘S'g(A),  g  >  4  in  [3]. 

The  case  q  =  Z  is  much  more  complicated  given  that  not  even  the  dimen¬ 
sion  of  5'3(A)  is  known  for  arbitrary  triangulations  A.  It  is  an  open  question 
whether  the  dimension  of  53  (A)  is  equal  to  Schumaker’s  lower  bound  [12].  The 
aim  of  this  paper  is  to  investigate  interpolation  by  Sl{A)  for  general  classes  of 
triangulations  A  consisting  of  nested  polygons  whose  vertices  are  connected 
by  line  segments.  Following  a  general  principle  of  locally  choosing  interpo¬ 
lation  points  for  53(A)  by  passing  from  triangle  to  triangle,  we  describe  an 
inductive  method  for  constructing  point  sets  that  admit  unique  Lagrange  (re¬ 
spectively  Hermite)  interpolation  by  53(A)  under  certain  assumptions  on  A. 
Moreover,  we  prove  that  the  dimension  of  these  spaces  is  equal  to  Schumaker’s 
lower  bound. 

In  this  way  we  obtain  a  class  of  triangulations  A  which  is  significamtly 
larger  than  the  special  class  described  in  [4].  Moreover,  the  methods  of  proof 
in  this  paper  are  different  from  those  in  [4].  It  is  important  to  note  that 
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triangulations  of  this  type  can  be  constructed  starting  from  any  given  points 
in  the  plane,  see  [11], 

The  numerical  examples  (with  up  to  100,000  interpolation  points)  show 
that  in  order  to  obtain  good  approximations,  it  is  desirable  to  subdivide  some 
of  the  triangles.  Our  method  of  constructing  interpolation  points  also  works 
for  these  modified  triangulations. 

We  note  that  our  interpolation  method  can  be  used  for  the  construction 
of  smooth  surfaces  without  involving  any  derivative  data.  For  scattered  data 
fitting,  the  needed  Lagrange  data  are  approximately  computed  by  local  meth¬ 
ods.  In  contrast  to  the  finite  element  methods  for  cubic  splines,  we  do  not 
need  to  subdivide  all  triangles  by  a  Clough-Tocher  split  or  use  derivatives. 


§2.  Preliminaries 

Let  A  be  a  regular  triangulation  of  a  simply  connected  polygonal  domain 
n  in  We  denote  by  ^^(A)  =  {s  e  :  sjr  €  Ha,  T  6  A}  the 

space  of  bivariate  splines  of  degree  3  and  smoothness  1  (with  respect  to  A). 
Here  Hs  =  span  {a;*' :  i',  fi  >  0,  u  fj,  <  3}  denotes  the  space  of  bivariate 
polynomials  of  total  degree  3. 

We  investigate  the  following  interpolation  problem.  Construct  a  set 
{zi, . . . ,  Ziv}  in  n,  where  N  =  dim 5^  —  (A),  such  that  for  each  function  /  G 
(7(fl),  a  unique  spline  s  G  53 (A)  exists  such  that  s{zi)  =  /(z*),  i  —  1, . . . ,  N. 
Such  a  set  {zi, . . . ,  zjv}  is  called  a  Lagrange  interpolation  set  for  53(A),  If  also 
partial  derivatives  of  /  are  involved,  then  we  speak  of  a  Hermite  interpolation 
set  for  ^3 (A). 

In  contrast  to  [4],  we  will  use  Bernstein- Bezier  techniques  [2,5].  Given  a 
spline  s  G  53(A),  we  consider  the  following  representation  of  the  polynomial 
pieces  p  =  six  G  II3  on  the  triangle  T  G  A  with  vertices  vi,V2,V3, 


p{x,y) 


L>+fl  +  (7  =  3 


(1) 


where  G  Hi,  /  =  1,2,3,  is  uniquely  defined  by  $/(t^fc)  =  6k, i,  k  —  1,2,3. 
This  representation  of  p  is  called  the  Bernstein- Bezier  representation  of  p, 
the  real  numbers  are  called  the  Bernstein-Bezier  coefficients  of  p,  and 

I  =  1,2,3,  are  the  barycentric  coordinates  (w.r.t.  T)  of  {x,y)  G  T. 

Definition  1.  A  set  A  C  {(i/,  p,  <7,  T)  :  v  +  p  +  a  =  3,  Tg  A}  is  called  an 
admissible  set  for  S3  (A)  if  for  every  choice  of  coefficients  aS^l,a ,  (z/,  /x,  <7,  T)  G  A, 
a  unique  spline  s  G  53(A)  exists  with  these  coefficients  in  the  above  Bernstein- 
Bezier  representation. 


The  above  Bernstein-Bezier  form  can  be  used  to  express  smoothness  con¬ 
ditions  of  polynomial  pieces  on  adjacent  triangles  Ti ,  T2  with  vertices  vi,V2,V3, 
respectively  vi,V2,i'4  (cf.  [2,5]), 


Theorem  2.  Let  s  be  a  piecewise  cubic  polynomial  function  deGned  on  Ti  U 
T2.  Tien  «  6  5^({Ti,T2})  iff  =  “So-  ‘' +  P  =  3,  and  n™,  = 

+  “LTi+i, 0^2(114)  +  t'  +  M  =  2. 
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For  later  use,  we  also  mention  here  the  following  relations  between  the 
Bernstein-Bezier  coefficients  of  a  cubic  polynomial  p  in  the  representation  (1) 
and  its  partial  derivatives  at  vi  in  direction  of  a  unit  vector  parallel  to  the 
edge  e  =  [ui,'y2],  denoted  by 


4ao = 4!i,o = ^("i) + 


U2II2 


,[rl 


K^i)  +  I^IK  -  «2||2  +  -  «2lll, 


“■1,2,0 

_  ^(^U,o  ~  ^ko,o) 

de  11^1-^2112 


(2) 


d'^piyi)  _  2  0  “  2a2j^o  ^3,0,0) 


[T] 


AT] 


ae2 


\\vi-V2\\l 


§3.  Main  Results 

In  this  section,  we  state  our  main  results  on  S'3(A),  where  A  consists  of 
nested  polygons  whose  vertices  are  connected  by  line  segments.  We  first  define 
this  class  of  triangulations.  Then,  we  determine  the  dimension  and  construct 
interpolation  sets  for  the  corresponding  spline  space.  Moreover,  we  show  that 
this  dimension  is  equal  to  Schumaker’s  lower  bound  [12].  Finally,  we  discuss 
a  property  of  A  which  is  essential  for  the  local  construction  of  interpolation 
points. 

First,  we  describe  triangulations  of  nested  polygons  and  decompose  the 
domain  into  finitely  many  subsets  needed  in  our  construction  of  interpolation 
points. 

Triangulations  of  nested  polygons.  We  consider  the  following  general  type 
of  triangulation  A.  Let  Pq,  -Pi,  •  •  • » -Pjk  be  a  sequence  of  closed  simple  polygonal 
lines,  and  let  be  the  closed  (not  necessarily  convex)  bounded  polygon  with 
boundary  P^.  Suppose  that  the  polygons  are  nested,  ie.,  C 

fj,  =  0, , . . ,  fc.  The  vertices  of  A  are  the  vertices  of  P^,  p  =  0, . . . ,  fc,  and 
one  vertex  inside  Pq.  The  edges  of  A  are  the  edges  of  P^,  p  =  0, . . . ,  fc,  and 
additional  line  segments  connecting  the  vertices  of  P^  with  the  vertices  of 
P^+i,  p  =  0, . . . ,  A:  —  1,  The  resulting  triangulation  A  of  0  :=  flk  does  not 
have  vertices  in  the  interior  of  p  =  0,...,k  —  l,  and  does  not  have 

edges  connecting  two  vertices  of  P^  other  than  the  edges  of  P^,  see  Figure  1. 
Decomposition  of  the  domain.  We  decompose  the  domain  0  into  finitely 
many  sets  Vq  C  Vi  C  •  ■  •  C  Vm  =  where  each  set  Vi,  is  the  union  of  closed 
triangles  of  A,  i  =  0, . . . ,  m.  Let  Vo  be  an  arbitrary  closed  triangle  of  A  in 
Uq.  We  define  the  sets  Vi  C  •  •  *  C  Kn  by  induction.  Assuming  Vi-i  is  defined, 
we  choose  a  vertex  Vi  of  A  such  that  there  exists  at  least  one  triangle  of  A 
with  vertex  Vi  and  a  common  edge  with  Vi-i.  Let  T^^i, . . .  >  1, 

be  all  such  triangles.  We  set  Vi  =  Vi-i  U  U  •  *  •  U  and  denote  by 

Ai  =  {rGA:TcVi}  the  subtriangulation  which  corresponds  to  the  set  Vi. 

The  vertices  Vi,  i  =  I, . . .  ,m,  are  chosen  as  follows.  After  choosing  Vq 
be  an  arbitrary  closed  triangle  of  A  in  Oq  ,  we  pass  through  the  vertices  of  Pq 
in  clockwise  order  by  applying  the  above  rule.  (It  is  clear  that  the  choice  of 
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Fig.  1.  Triangulation  of  nested  polygons, 

these  vertices  is  unique  after  fixing  the  first  vertex.)  Now,  we  assume  that  we 
have  passed  through  the  vertices  of  We  fix  a  vertex  of  that  is 

connected  with  at  least  two  vertices  of  P/i+i-  Then  w.r.t.  clockwise  order,  we 
choose  the  first  vertex  of  greater  than  which  is  connected  with  at  least 
two  vertices  of  Pn-i.  Then  we  pass  through  the  vertices  of  in  clockwise 
order  until  w~,  and  pass  through  the  vertices  of  P^  in  anticlockwise  order 
until  w'^  by  applying  the  above  rule.  (Here  w'^  denotes  the  vertex  next  to 
in  clockwise  order  and  w~  denotes  the  vertex  next  to  in  anticlockwise 
order.)  Finally,  we  choose  the  vertex  w^.  (It  is  clear  that  after  fixing  the 
choice  of  the  vertices  on  P^  is  unique.) 

The  construction  of  an  admissible  set  for  53(A)  and  the  choice  of  inter¬ 
polation  points  depend  on  the  following  properties  of  the  triangulation  A. 

Definition  3.  (1)  An  interior  edge  e  with  vertex  v  of  the  triangulation  A  is 
called  degenerate  at  v  if  the  edges  with  vertex  v  adjacent  to  e  lie  on  a  line. 
(2)  An  interior  vertex  v  of  A  is  called  singular  if  v  is  a  vertex  of  exactly  four 
edges  and  these  edges  lie  on  two  lines.  (3)  An  interior  vertex  v  of  A  on  the 
boundary  of  a  given  subtriangulation  A'  of  A  is  called  semi-singular  of  type 
1  w.r.t.  A'  if  exactly  one  edge  with  endpoint  v  is  not  contained  in  A'  and 
this  edge  is  degenerate  at  v.  (4)  An  interior  vertex  v  of  A  on  the  boundary 
of  a  given  subtriangulation  A'  of  A  is  called  semi-singular  of  type  2  w.r.t.  A' 
if  exactly  two  edges  with  endpoint  v  are  not  contained  in  A'  and  these  edges 
are  degenerate  at  v.  (5)  A  vertex  v  of  A  is  called  semi-singular  w.r.t.  A'  if  v 
satisfies  (3)  or  (4). 

In  the  following,  we  construct  an  admissible  set  and  interpolation  sets  for 
53(A),  where  A  is  a  nested-polygon  triangulation. 

Construction  of  an  admissible  set.  First,  we  choose  ^0  =  • 

=  3}  and  then,  proceeding  by  induction,  we  successively  add  admissi¬ 
ble  points  on  \  Vi_i,  i  =  1, . . . ,  m.  Assuming  that  an  admissible  set  Ai-i  on 

Vi-i  has  been  constructed,  we  choose  admissible  points  on  Vi  \  Vi_i  as  follows. 
By  the  above  decomposition  of  Vi  \  Vi_i  is  the  union  of  consecutive  trian¬ 
gles  Ti^i, . . .  ,Ti^ni  with  vertex  Vi  and  common  edges  with  Vi_i.  We  denote 
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the  consecutive  endpoints  of  these  edges  by  •  ■  •,Vi,ni7  and  the  piece- 

wise  polynomials  in  the  representation  (1)  on  Tij  by  pij  G  IIs,  j  =  1, . .  .n,, 
where  the  vertices  of  Ti^j  are  ordered  as  follows:  Vi,Vij,Vij+i.  Furthermore, 
we  denote  by  Cij  the  edges  j  =  0, . . .  ,nj. 

We  need  the  following  properties  of  the  subtriangulation  Aj  =  {T  G  A  : 
T  C  Vi}  at  the  vertices  Vi^o?  •  ♦  • ,  Vi^„. : 

(a)  Cij  is  non- degenerate  at  Vijy 

(b)  Vij  is  semi-singular  w.r.t.  Ai.  (This  latter  property  is  only  relevant  if 
Vij  lies  on  the  boundary  of  Ai,  i.e.,  for  j  G  {0,ni}.) 

For  j  G  {1, rii  —  1},  we  set  Cij  =  1  if  (a)  holds,  and  Cij  =  0  otherwise.  For 
j  G  {0,ni},  we  set  Cij  =  1  if  both  (a)  and  (b)  hold,  and  Cij  =  0  otherwise. 
Moreover,  we  set  Ci  =  Cij,  and  assume  Cj  <  3,  i  =  1, . . . ,  n. 

Now,  we  construct  the  following  admissible  points  on  Vi  \  Vi_i.  If  Cf  =  3, 
then  no  point  is  chosen.  If  Ci  =  2,  then  we  choose  (3,0, 0,Ti,i).  If  Ci  =  1,  then 
we  choose  (3,0,0, Ti,i)  and  (2,0, 1,2}^),  where  Cij  is  an  edge  with  Cij  =  0. 
If  a  =  0,  then  we  choose  (3, 0,0,  Ti^i),  (2, 0,  l,Ti^i)  and  (2,  l,0,Ti^i).  The 
admissible  set  Ai  on  Vi  is  obtained  by  adding  these  points  to  Ai-i. 

Construction  of  interpolation  sets.  We  choose  interpolation  points  in  Vq 
and  then  in  Vi  \  Vi_i, «  =  1, . . . ,  m,  successively.  In  the  first  step,  we  choose  10 
different  points  in  Vq  (respectively  10  Hermite  interpolation  conditions)  which 
admit  unique  Lagrange  interpolation  (respectively  Hermite  interpolation)  by 
the  space  IIs.  For  example,  for  Lagrange  interpolation,  we  may  choose  four 
parallel  line  segments  l^,  in  Vq  and  i/  different  points  on  each  l^^v  =  1, 2,3,4. 
Assuming  that  the  interpolation  points  in  V^-i  have  already  been  chosen,  we 
proceed  to  F*  \  Vi_i  as  follows. 

For  Lagrange  interpolation,  we  choose  the  following  points  in  F  \  F-i. 
If  Ci  =  3,  then  no  point  is  chosen.  If  Cj  =  2,  then  we  choose  u*.  If  Cj  =  1,  then 
we  choose  Vi  and  one  further  point  on  some  edge  ei^j  with  Ci^j  —  0.  If  c*  ==  0, 
then  we  choose  Vi  and  two  further  points  on  two  different  edges. 

For  Hermite  interpolation,  we  require  the  following  interpolation  condi¬ 
tions  for  s  G  «S'3(A)  at  the  vertex  Uj.  If  Cj  =  3,  then  no  interpolation  condition 
is  required  at  u,.  If  Cj  =  2,  then  we  require  s{vi)  =  f(vi).  If  Ci  =  1,  then 
we  require  s{vi)  =  f{vi)  and  -^-(vi)  =  where  is  some  edge 

with  Cij  =  0.  If  Ci  =  0,  then  we  require  s(ui)  =  f{vi),  |j('yi)  =  and 

=  d^M- 

By  the  above  construction,  we  obtain  a  set  of  points  for  Lagrange  inter¬ 
polation  respectively  a  set  of  Hermite  interpolation  conditions. 

Theorem  4.  Let  A  be  a  triangulation  of  nested  polygons.  If  for  all  i  G 
{l,...,m},  Ci  <  3  and  no  vertex  Vi  is  simultanously  semi-singular  (of  type 
2)  w.r.t  Ai  and  non-singular,  then  a  unique  spline  in  53(A)  exists  which 
satisfies  the  above  Lagrange  (respectively  Hermite)  interpolation  conditions. 
In  particular,  the  total  number  of  interpolation  conditions  is  equal  to  the 
dimension  of  53(A). 
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Proof:  First,  we  prove  that  the  set  constructed  above  is  an  admissible  set 
for  Sl{A).  To  this  end,  we  show  by  induction  that  Ai  is  an  admissible  set  for 
53(A)|Ai  =  {^lAi  :  5  G  53  (A)}.  This  is  clear  for  i  =  0.  Now,  we  assume  that 
Ai-i  is  an  admissible  set  for  5'3(A)|Ai_i ,  where  i  G  {1,  •  ■  •  ,rn},  and  consider 
Vi.  For  simplicity,  we  omit  here  the  index  i  for  Vi.,Vij,eij,pij,Tij  and  n^. 
It  follows  from  the  induction  hypothesis  and  Theorem  2  that  the  coefficients 
=  0,...,3-i/,  Z/  =  0,1,  oipj  G  Ha,  j  =  l,..,,n,  on  Tj,  are 
uniquely  determined.  Moreover,  if  Cij  =  1  for  some  j  G  {1, ...  ,n  —  1},  then 
it  follows  from  Theorem  2  that  the  coefficient  uniquely  determined. 

In  the  following,  we  show  that  if  Ci^o  =  1)  then  the  coefficient  q 
uniquely  determined.  Let  us  consider  the  case  where  vq  is  semi-singular  of 
type  2  w.r.t.  Aj.  (The  case  that  uq  is  semi-singular  of  type  1  w.r.t.  Aj  is 
analogous.)  We  denote  by  f/  G  A,  /  -  1, ...  ,3,  the  triangles  with  vertex  vq 
not  contained  in  Aj  in  anticlockwise  order,  and  by  ii  the  common  edge  of  Ti 
and  fi+i,  ^  =  1,2.  Since  fa  has  a  common  edge  with  Aj_i,  it  follows  from 

Theorem  2  that  the  coefficient  of  ps  G  Ila  on  fa  is  uniquely  determined. 
Moreover,  since  62  and  ei  are  degenerate  at  uq,  the  coefficients  of  pi  G  Ha 

on  Ti,  i!  =  1,2,  are  uniquely  determined.  Since  cq  is  non-degenerate  at  uq,  it 
follows  from  Theorem  2  that  the  coefficient  is  uniquely  determined.  We 
note  that  since  A  is  a  nested-polygon  triangulation,  at  least  two  triangles  with 
vertex  vq  not  contained  in  Aj  exist.  Therefore,  if  Cj,o  =  0,  then  the  coefficient 
yet  determined. 

'  '  •  \T  1 

Analogously  as  above,  it  can  be  shown  that  the  coefficient  ^2,0,1 
quely  determined  if  Cj^n  =  Otherwise,  this  coefficient  is  not  yet  determined. 

Now,  we  consider  the  vertex  v.  The  arguments  below  will  show  that  we 
may  assume  that  v  is  an  interior  point  of  A.  We  denote  by  T„+i  G  A,  ^  = 
1, . . .  ,r,  7-  >  3,  the  triangles  with  vertex  v  not  contained  in  Aj  in  anticlock¬ 
wise  order.  Moreover,  let  the  piecewise  polynomials  G  Ila,  I  =  1, . . .  ,r, 
on  Tn+i  in  the  representation  (1)  be  given  such  that  the  first  barycentric  co¬ 
ordinate  always  corresponds  to  v.  The  above  arguments  show  that  exactly 
Ci  <  3  coefficients  of  the  set  Ci  —  {a[,^3L,y_CT,o-  •  =  0, . . . ,  3  -  i/,  = 

2,3,  Z  =  1, . . .  ,n  +  r}  are  uniquely  determined.  On  the  other  hand,  we  con¬ 
struct  3  -  Ci  additional  admissible  points  from  Ci  on  Vf  \  Vl-i.  Now,  it  follows 
from  the  (7^-property  at  v  and  Theorem  2  that  all  coefficients  from  Ci  are 
uniquely  determined.  By  our  method  of  passing  through  the  vertices  of  A, 
V  is  not  semi-singular  of  type  1  w.r.t.  Aj.  In  particular,  if  v  =  for  some 
p  G  {0,  ...,/c}.  Moreover,  by  assumption  v  can  be  semi-singular  of  type  2 
w.r.t.  Ai  only  if  v  is  singular.  In  this  case,  we  have  r  —  3,  and  it  follows 
from  Theorem  3.3  in  [13]  that  the  coefficient  is  uniquely  determined. 

Otherwise,  if  r  >  4,  then  for  some  I  G  {l,...,r  —  1}  one  common  edge  of 
Tn+i  and  T^+i+i  is  non- degenerate  at  v,  and  we  can  also  proceed  with  our 
arguments. 

Since  all  relevant  differentiability  conditions  at  the  edges  with  endpoint 
V,  respectively  Vj,  were  involved,  the  above  shows  that  Ai  is  an  admissible  set 
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for  5'3(A)|Ai.  Thus,  the  set  Am  is  an  admissible  set  for  53(A). 

Therefore,  the  cardinality  of  Am  is  equal  to  the  dimension  of  53(A).  By 
construction,  it  is  evident  that  the  number  of  Lagrange  interpolation  points, 
respectively  the  number  of  Hermite  interpolation  conditions  coincides  with 
this  cardinality. 

By  an  inductive  argument,  it  follows  from  (2)  that  the  Hermite  inter¬ 
polation  conditions  at  v  determine  the  Bernstein-Bezier  coefficients  of  the 
admissible  points  chosen  on  Vi  \  Vi-i.  Analogously,  the  Lagrange  interpola¬ 
tion  conditions  uniquely  determine  the  interpolating  spline  on  the  edges  of 
Vi  \  Vi-i.  Therefore,  the  interpolating  spline  is  uniquely  determined  on  all  of 
Vi  \  Vi-i.  This  completes  the  proof  of  Theorem  4.  □ 

For  arbitrary  triangulations,  Schumaker  [12]  gave  the  following  lower 
bound  L(A)  for  the  dimension  of  53(A), 

L{A)  =  3Vb(A)  +  2yr(A)  +  a(A)  -f  1.  (3) 

Here,  Vb(A)  is  the  number  of  boundary  vertices  of  A,  1^/(A)  is  the  number 
of  interior  vertices  of  A  and  ct{A)  is  the  number  of  singular  vertices  of  A.  For 
bounds  on  the  dimension  of  bivariate  spline  spaces  see  also  Manni  [6]. 

Theorem  5.  If  a  triangulation  A  of  nested  polygons  satisfies  the  hypotheses 
of  Theorem  4,  then  the  dimension  of  S^(A)  is  equal  to  L(A). 

Proof:  We  have  to  show  that  the  cardinality  of  Am  is  equal  to  L(A).  We 
prove  this  by  induction.  We  set  5(Ao)  =  0  and  for  i  e  {1, . . .  m},  we  denote 
by  S{Ai)  the  set  of  boundary  vertices  w  of  Aj  such  that  w  =  vi^o  and  c/,o  =  1 
(respectively  w  =  vi^m  and  c;,n,  =  1)  for  some  I  €  z}.  Moreover,  let 

Oi  be  the  cardinality  of  5(At)  and  Ui  be  the  cardinality  of  Ai.  We  will  show 
that 

L(Ai)  =  Ui  +  dj,  z  =  0,  ...,m.  (4) 

This  is  evident  for  i  =  0.  We  assume  that  L{Ai-i)  =  aj_i  -f  for  some  i  G 
{1, . . .  ,m}  and  consider  V.  We  have  VsiAi)  =  T/B(Ai_i)  -n^  +  2,  Vj{Ai)  = 
V7(Ai_i)  -\-ni-l,  a{Ai)  —  a{Ai-i)  7t,  where  7^  is  the  number  of  singular 
vertices  from  the  set  {vij  :  j  =  1, . . . ,  n*  -  1}.  Since  a*  =  a^-i  -h  3  -  c^, 
it  follows  from  the  induction  hypothesis  and  some  elementary  computations 
that 

L{Ai)  =  Oi  +  ^i-i  +  Ci  -{-ji-m  +  l. 

By  our  method  of  passing  through  the  vertices  of  A,  it  is  evident  that  if 
Vi,o  =  Vi-i  G  S{Ai),  then  Vi^o  ^  5(Ai_i).  In  the  following,  we  show  that  if 
Vi^m  e  5(Ai),  then  Vi^m  ^  5(Ai_i).  First,  let  us  assume  that  Vi^m  =  vi^o 
for  some  /  G  {1, . . .  ,i  —  1}.  If  Vi^m  is  semi-singular  of  type  2  w.r.t.  Aj,  then 
at  least  three  edges  of  A  not  contained  in  A;  are  attached  to  vi^q.  Hence, 
Ci,o  =  0.  If  Vi^m  is  semi-singular  of  type  1  w.r.t.  A^,  then  the  edge  Ci^m  is 
non-degenerate  at  Vi^m,  since  =  1.  Therefore,  is  not  semi-singular  of 
type  2  w.r.t.  A/.  Again,  c/,o  =  0  holds.  The  remaining  case  Vi^m  =  Vi-i^m-ij 
where  =  1,  follows  by  the  same  arguments. 
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Now,  we  show  for  j  6  {1, . . .  ,ni  -  1}  that  every  non-singular  vertex  Vi^j 
such  that  Cij  is  degenerate  at  Vij  lies  in  «S(A,_i).  First,  we  consider  the  case 
j  =  1.  Set  Vi^  —  Vi^i  and  let  eo  be  the  edge  that  connects  Vi^o  and  Vi^i.  We 
have  to  consider  two  cases. 

Case  1.  (The  vertices  and  Vi-2  are  connected  by  an  edge  e.)  If  eo  is 
non-degenerate  at  Vi^i  then  =  1.  (In  this  case  Vi^i  is  semi-singular  of 

type  1  w.r.t.  Ai_i.)  Otherwise,  since  Vi^i  is  non-singular,  e  is  non-degenerate 
at  Vi^i.  Thus,  Ci_2,Tii_2  =  1-  semi-singular  of  type  2  w.r.t. 

Ai_2.)  We  note  that  Vi^i  is  not  semi-singular  w.r.t.  Ajj+i,  since  at  least  three 
edges  of  A  not  contained  in  Ajj+i  are  attached  to 

Case  2.  (The  vertices  and  Ui_2  are  not  connected  by  an  edge.)  If  eo  is 
non-degenerate  at  Vi^i  then  we  also  have  Ci_ =  1.  (In  this  case  Vi^i  is 
semi-singular  of  type  1  w.r.t.  Aj-i.)  We  note  that  Ui,i  is  not  semi-singular 
w.r.t.  Aj^+i,  since  eo  is  non- degenerate  at  Vi^i.  Otherwise,  let  e  be  the  edge 
that  connects  with  ifti+i.  Since  is  non-singular,  e  is  non-degenerate 
at  Vi^i.  Thus,  Cii+1,0  =  1-  (In  this  case  Vi^i  is  semi-singular  of  type  2  w.r.t. 
Ajj+i.)  We  note  that  in  this  case  Vi^i  is  semi-singular  of  type  1  w.r.t.  A,_i, 
but  =  0. 

Now,  we  consider  the  remaining  case  j  G  {2, ...  ,71,  —  1}.  Set  Vi^  =  Vij 
and  let  e  be  the  edge  that  connects  Vij  with  Since  Vij  is  non-singular, 

it  follows  that  Vij  is  not  semi-singular  of  type  2  w.r.t.  to  Ai.,  Therefore,  e  is 
non-degenerate  at  Vij.  Hence,  Ci. =  1*  (In  this  case  Vij  is  semi-singular 
of  type  1  w.r.t.  Ai^.+i.)  We  note  that  in  the  case  j  G  {2,  ...,ni  —  1},  by 
our  method  of  passing  through  the  vertices  of  A,  the  value  is  not 

influenced  by  the  geometrical  properties  of  A  at  Vij. 

The  above  proof  now  implies  ai  =  ai-i  +  Cj  +  7i  —  rij  +  1,  and  therefore, 
(4)  holds.  Since  am  =  0,  we  get  L(A)  =  Om-  This  proves  the  theorem.  □ 

In  Theorem  4  we  assume  that  for  all  i  G  {!,..., m},  no  vertex  Vi  is 
simultaneously  semi-singular  (of  type  2)  w.r.t.  Aj  and  non-singular.  In  the 
following,  we  show  that  this  assumption  is  essential  for  the  local  construction 
of  interpolation  points. 


Example  6.  Let  v  =  vi  =  (0,0),  v^  =  vq  =  vi^q  =  (7?0):  7  <  — 

(r, mr),  r  <  0,  m  >  0,  V2  —  1^1,2  =  (0,6),  6  <  0,  and  set  U3  =  (o;,0),  a  > 
0,  V4  =  (0,  ^),  P  >  0.  Let  v  be  connected  with  V3  and  V4  and  vi-i  be  connected 
with  vi,  /  =  1, . . . ,  5.  Then  v  is  simultaneously  semi-singular  (of  type  2)  w.r.t. 
Ai  and  non-singular.  Furthermore,  we  denote  by  Ti  the  triangle  with  vertices 
v,vi-i,vi  and  by  pi  G  Ha  the  polynomial  pieces  on  T;,  /  =  1, . . .  5,  in  the  rep¬ 
resentation  (1).  We  consider  the  set  C2  =  c7'  =  0,...,3-i/,  1/  = 

1, . . . ,  3,  Z  =  1, . . . ,  5}.  For  C^-splines,  it  follows  from  Theorem  3.3  in  [13]  that 
each  subset  of  C2  that  uniquely  determines  all  coefficients  of  C2  has  cardinality 
8  and  contains  the  coefficients  a^i^Q  2)  ^  =  3,4.  By  the  proof  of  Theorem  4, 
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the  coefficients  ^  -  0,1,2,  and  cr  =  1,2,  are  uniquely 

determined.  If  Cj^i  is  non-degenerate  at  ui,  then  in  addition  a^^oji  is  uniquely 
determined.  Otherwise,  this  coefficient  is  not  determined.  Hence,  if  Ci^i  is  non¬ 
degenerate  at  ui,  then  we  have  to  choose  exactly  one  additional  coefficient  to 
determine  all  coefficients  of  C2,  and  otherwise,  we  have  to  choose  exactly  two 
additional  coefficients.  We  claim  that  in  the  latter  case  every  choice  of  exactly 
two  additional  coefficients  from  the  set 
determine  all  coefficients  of  C2. 

Proof;  Suppose  that  is  degenerate  at  v\  and  choose,  for  example,  J  q 
and  For  simplicity,  we  set  <21  =  ^3  —  ^4  = 

as  =  a^^J  0,  06  =  and  assume  that  the  remaining  coefficients  in 

C2  are  zero.  By  Theorem  2, 


T  —  ^  Oi  0  u 

03  =  ( - /?  -I-  1)02,  04  =  (1 - )a3,  01  =  (1  —  — )02  -f  — 03, 

771X7  7  P  p 

_  8.  6  o:.  „  ,,  .,m  I.  ^ .  T 

06  =  (1  -  ^)05  4-  ^04,  05  =  (1  -  — )a2,  0  =  ((-t)(-^  -I-  -f-  -I-  — flQ. 

Eliminating  aj,  j  E  {3,4,5},  yields  oi  =  (1  -}-  6^)02,  oq  ~  (1  ~  ^)(1  -{- 
5~^)o2.  By  some  elementary  computations,  we  obtain  for  the  determinant 
D  o?  the  corresponding  system 


and  it  is  easy  to  verify  that  H  =  0  iff  Ci^i  is  degenerate  at  vi.  Other  choices  of 
exactly  two  additional  coefficients  from  the  set  (03^0,0 >  ^2^1,0’ 
can  be  examined  in  the  same  way,  which  proves  our  claim.  □ 

Note  that  if  ei,i  is  non-degenerate  at  vi,  then  every  choice  of  exactly  one 
additional  coefficient  in  the  set  {usyolo’  ^2^0!!}  determines  all  coefficients 

in  C2. 

We  finally  discuss  some  numerical  aspects  of  our  scheme.  A  method  for 
constructing  nested  polygon  triangulations  A  of  given  points  in  the  plane 
which  satisfy  the  conditions  of  Theorem  4  was  developed  in  [11].  Our  numer¬ 
ical  tests  show  that  in  order  to  obtain  good  approximations,  it  is  necessary  to 
subdivide  some  of  the  triangles  (for  details  see  [10,11]).  Meanwhile,  we  have 
computed  such  examples  with  a  high  number  of  interpolation  conditions.  We 
only  mention  here  that,  for  example,  Lagrange  respectively  Hermite  interpo¬ 
lation  of  Franke’s  test  function  by  cubic  C^-splines  with  118,822  interpolation 
conditions  yields  an  error  of  4.66902  *  10“®  in  the  uniform  norm. 
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Stable  Local  Nodal  Bases  for 
Bivariate  Polynomial  Splines 


Oleg  Davydov  and  Larry  L.  Schumaker 


Abstract.  We  give  a  stable  construction  of  local  nodal  bases  for  spaces 
of  bivariate  polynomial  splines  of  degree  d  >  5  defined  on  arbitrary  tri¬ 
angulations.  The  bases  given  here  differ  from  recently  constructed  locally 
linearly  independent  bases,  and  in  fact  we  show  that  stability  and  local 
linear  independence  cannot  be  achieved  simultaneously. 

§1.  Introduction 

Given  a  regular  triangulation  A,  let 

5J(A)  :=  {s  e  :  s\t  ^Vd  for  all  triangles  T  e  A}, 

where  Vd  is  the  space  of  polynomials  of  degree  d,  and  0  is  the  union  of  the 
triangles  in  A.  In  this  paper  we  focus  on  the  case  r  =  1  and  d  >  5.  The  main 
result  of  the  paper  is  a  construction  of  a  basis  B  := 
the  following  properties: 

PI)  The  basis  B  is  local  in  the  sense  that  for  each  1  <  i  <  n,  the  support  of 
Bi  is  contained  in  star(ni)  (see  the  end  of  this  section)  for  some  vertex  Vi^ 
P2)  The  set  B  is  stable  in  the  sense  that  there  exist  constants  Ki  and  K2 
dependent  only  on  the  smallest  angle  9^  in  A  such  that 


n 

1=1 

for  all  choices  of  the  coefficient  vector  c  =  (ci, . , . ,  Cn). 

Bases  for  «S^(A)  satisfying  property  PI  were  constructed  in  [14]  using  nodal 
techniques,  but  they  fail  to  satisfy  property  P2  for  triangulations  with  near 
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singular  vertices  or  near  degenerate  edges,  even  if  the  smallest  angle  in  the 
triangulation  is  controlled. 

For  convenience,  we  recall  the  definitions  of  some  of  the  terminology  used 
above.  Suppose  u  is  a  vertex  of  a  triangulation  which  is  connected  to  ,  i;2 ,  V3 
in  counter-clockwise  order.  Then  the  edge  e  ;=  (^,^2)  is  said  to  be  near¬ 
degenerate  at  V  (degenerate  at  u)  provided  that  the  edges  (u,ui)  and  (^,^3) 
are  near-collinear  (collinear).  The  vertex  v  is  called  near-singular  (singular) 
if  there  are  exactly  four  near- degenerate  (degenerate)  edges  attached  to  it. 
Given  a  vertex  u  of  A,  star(u)  =  star^(u)  is  the  set  of  triangles  sharing  u, 
and  star^(u)  is  defined  recursively  as  the  union  of  the  stars  of  the  vertices  of 
star^“^(u). 


§2.  Nodal  Determining  Sets  and  Nodal  Bases 

Suppose  s  is  a  spline  in  tSj(A),  and  that  u  is  a  point  in  fl.  In  this  paper 
we  are  interested  in  certain  linear  functionals  defined  on  S^{A)  in  terms  of 
values  and  derivatives  of  s  at  points  v  in  H.  Such  functionals  are  called  nodal 
functionals.  There  are  three  types  of  nodal  functionals  of  interest  here: 

1)  the  value  s{v), 

2)  the  directional  derivative  Dlf}s{v),  where  w  is  a  given  vector  and  m  is  a 
positive  integer, 

3)  the  mixed  derivative  at  a  vertex  v  of  A,  where  wi  and  W2 

are  two  noncollinear  vectors  which  point  into  a  common  triangle  T  of  A. 

Definition  1.  A  collection  M  :=  {AJ-Lj  of  nodal  functionals  is  called  a 
minimal  nodal  determining  set  for  5](A)  provided  they  form  a  basis  for  the 
dual  space  (5] (A))*.  If  M  is  such  a  set,  then  there  exist  unique  splines 
B  in  «S](A)  such  that 

XiBj  =  )  iij  —  1, . .  •  ,7i.  (2) 


We  call  B  a  nodal  basis  for  5j(A). 

In  this  paper  we  will  concentrate  on  nodal  functionals  which  involve 
derivatives  De  along  edges  e  :=  {vi,V2}  of  the  triangulation  A,  or  perpen¬ 
dicular  to  such  edges.  Denoting  the  Cartesian  coordinates  of  a  point  v  by 
we  see  that  the  derivative  along  the  edge  e  is  given  by 

(uf  -  vf)Da,s(v)  +  {v^  -  vl)Dys{v) 
while  the  derivative  perpendicular  to  the  edge  e  is  given  by 


De±s{v) 


-  v\)Da:s{v)  -  (uf  -  -u^)Dys(u) 


Note  that 


±s(u). 


~~  ^{V2,Vl) 
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§3.  Smoothness  Conditions  Between  Polynomial  Pieces 

It  is  well-known  how  to  describe  smoothness  between  polynomials  defined  on 
adjoining  triangles  in  terms  of  the  Bernstein-Bezier  coefficients  of  the  two 
polynomials.  Here  we  need  similar  conditions  in  terms  of  nodal  information. 
Suppose  T  =  {vi,V2yV^)  and  T  =  {vi,V2,vs}  are  two  adjacent  triangles  with  a 
common  edge  e  =  We  set  6i  =  ZU3V1V2,  O2  =  /.vsV2Vi,  61  =  /.v:iViV2, 

62  —  IV3V2V1.  Suppose 


<  1;''°  <  •  •  •  Ujf  5  <  V2 

(3) 

Vi  <  vl'^  <  <t>2 

are  given  points  lying  in  the  interior  of  the  edge  e. 

Lemma  2,  Letj),p  be  polynomials  of  degree  d  >  5  defined  on  adjoining 
triangles  T  and  T  as  above.  Then  p  and  p  join  together  with  smoothness 
across  the  edge  e  :=  (^1,^2)  if  and  only  if  the  difference  g  =  p  —  p  satisfies 

g{vi)  =  Deg{vi)  =  D^±g{vi)  =  Dlg(vi)  =  0, ,  z  =  1, 2,  (4) 

g{v^'^)  =  0,  i  =  l,...,d-5, 

1 

De-^sK’')  =  0,  i  =  l,...,d-A, 

and 

where  Ui  sin^j,  Ui  :=  sin^f,  di  :=  sin(^i  -1-  0i),  i  =  1,2. 

Proof:  We  follow  the  method  of  proof  of  the  main  result  in  [14].  Concerning 
necessity,  we  first  observe  that  if  p  and  p  join  with  continuity  across  e, 
then 

g(v)  =  Dy}g(v)  =  0,  for  all  v  e  e,  (7) 

where  w  is  any  unit  vector  noncollinear  with  the  edge  e.  This  implies  (5) 
and  the  conditions  on  g,  D^g  and  D^xg  in  (4).  The  conditions  on  the  second 
derivatives  are  easily  obtained  by  differentiating  the  identities  (7)  along  the 
edge  e  and  using  the  fact  that 

^iD{vuV2}PM  =  ^iD{y,,y^)p{vi)  +  aiD^^^^^^)p{vi), 
^2D/^y^^y,)p{V2)  =  ^2D(y^^y^)p{V2)  +  (72D(^y^^ii^)p{V2). 

To  prove  sufficiency,  suppose  that  p  and  p  satisfy  (4)-(6).  Then  the 
univariate  polynomial  ^le  is  of  degree  at  most  d  and  satisfies  d  -f-  1  homoge¬ 
neous  Hermite  interpolation  conditions  on  e.  Therefore  g(v)  =  0  for  u  G  e. 
This  shows  that  p  and  p  join  continuously.  We  now  consider  the  cross¬ 
derivative  q  :=  which  is  a  univariate  polynomial  of  degree  at  most 
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d  —  1.  By  (4)-(5),  q  has  d  —  2  zeros  vi^v\'^ ^ . . .  ,v^^\^V2  on  e.  Moreover,  by  (6), 
Deq{vi)  =  Deq{v2)  ~  0,  as  is  easy  to  check  by  expressing  DeD^±p{vi)  as  a  lin¬ 
ear  combination  of  D‘^p{vi)  and  DeD(^y^^y^)p{vi)  and  expressing  DeD^±p{vi) 
in  terms  of  Dlp{vi)  and  DeD^y^^y^)p{vi),  and  similarly  for  V2.  Therefore, 
gi  =  0,  and  we  have  shown  that  p  and  p  join  with  -smoothness.  □ 

For  a  different  set  of  nodal  smoothness  conditions,  see  [5]. 


§4.  Construction  of  a  Stable  Local  Nodal  Basis  for  «5j(A) 

In  this  section  we  begin  by  defining  a  spanning  set  A/a  of  nodal  functionals 
for  («S^(A))*.  Then  we  choose  an  appropriate  linearly  independent  subset 
M  which  forms  a  basis  for  («Sj(A))’^.  This  will  involve  analysing  the  linear 
dependencies  between  elements  of  A/a  the  smoothness  conditions).  The 
corresponding  nodal  basis  determined  by  the  duality  conditions  (2)  will  be  the 
desired  stable  local  basis  for  <Sj(A).  Given  a  triangle  T  (vi,V2,V3},  let 

Vijk-= - ^ - ,  i-]-j  +  k  =  d. 

Given  an  edge  e  of  A,  let  and  v^'^  be  the  points  defined  in  (3).  We  define 

C'^  :=  [Xjjf.s  =  :  i  +  j  +  k  =  d,  2<i,j,k}, 

S{e):={Xfs  =  s{vf):  i  =  l,...,d-b} 

U{Xfs  =  \e\D,.s{v^'^):  t  =  1, . . . ,  d  -  4}, 
where  |e|  denotes  the  length  of  e. 

Given  a  vertex  u  in  A,  suppose  the  vertices  connected  to  v  are  Ui , . . . ,  in 
counterclockwise  order  (with  Vi  a  boundary  vertex  if  v  lies  on  the  boundary), 
and  let  =  {v,Vi,Vi^i),  =  {v,Vi),  Si  =  Ze^ei+i,  where  if  v  is  an  interior 
vertex,  we  identify  vi^n  =  ve,  Denote  by  |star(u)|  the  diameter  of 

star('i;).  Let 

:=  =  |star(tj)P+^DiDjs(u)  :  0  <  z+i  <  1} 

Tl^iv)  :=  {AVp6  =  i  =  1,. . .  ,n} 


U 


sin  Oi 


|star  (z;)pDe,De.^js(z;)  :  z  =  1, . . .  ,n} 


if  V  is  an  interior  vertex,  and 

^2(V)  :=  {A-,p5  =  =  i  =  2,...,n-l} 

Li  {AVpS  =  |star  (z;)|^De.s(u)  ;  z  =  l,n} 

U  {A-,^5  =  ^|star(z;)pZ)e,De,+,5(zj)  :  z  =  1,...  ,n  -  1} 
if  z;  is  a  boundary  vertex.  Let 

Va  :=  U  U  U  [l>i(f)U7i2(t>)]. 

TeA  eCA  VGA 

The  Markov  inequality  implies  that  for  all  s  €  <5] (A)  and  all  A  G  A/a, 

|As|  <  a:||s||oc, 

for  some  constant  depending  only  on  d  and  the  smallest  angle  ^a  in  A. 


(8) 
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Lemma  3.  The  set  A/a  is  a  spanning  set  for  (Sj(A))*.  Moreover,  the  only 
linear  dependencies  between  elements  of  A/a  are  given  by 

AU  +  Ar-i,„  =  sin(fli  +  0i_,)AVp  (9) 

for  every  vertex  v  and  every  interior  edge  ei  attached  to  v. 

Proof:  Let  s  G  «S](A).  If  As  =  0  for  all  A  G  A/a,  then  on  each  triangle  T  G  A 
there  are  exactly  homogeneous  Hermite  interpolation  conditions  on  s, 

and  it  is  easy  to  see  that  they  force  s  to  be  zero.  It  follows  that  A/a  is  a 
spanning  set  for  («Sj(A))*.  The  second  statement  follows  immediately  from 
Lemma  2.  □ 

Algorithm  4.  (Construction  of  a  stable  local  nodal  basis  for  «S^(A).j  Let 
be  the  set  of  splines  determined  by  the  duality  conditions  (2)  corre¬ 
sponding  to  the  following  set  M  :=  {Ai}7_i  of  nodal  functionals: 

1)  For  each  triangle  T,  choose  the  {^2"^)  nodal  functionals  in  C^. 

2)  For  each  edge  e  =  {vi,V2),  choose  the  2d  —  9  nodal  functionals  A®’^  and 

inS(e). 

3)  For  each  vertex  v,  choose  the  three  nodal  functionals  in  T>i(v). 

4)  For  each  vertex  v,  choose  the  following  nodal  functionals  in  7l2(v): 

a)  one  of  the  functionals  A^  ^  corresponding  to  the  Erst  mixed  derivative 
at  V,  and 

b)  all  functionals  A^p  corresponding  to  the  pure  second  derivatives  at 
V,  with  one  exception:  if  v  is  a  nonsingular  interior  vertex,  the  func¬ 
tional  Xf^  p  is  omitted,  where  io  is  chosen  such  that 

|sin(0io  +  >  |sin(0i  +  for  aJi  i  =  1, . . . ,  n.  (10) 

Theorem  5.  The  set  M  of  Algorithm  4  is  a  minimal  nodal  determining  set 
for  <5j(A),  and  the  nodal  basis  {Si, . . . ,  Bn]  for  «S](A)  deEned  in  (2)  is  local 
and  stable,  Le.,  it  satisEes  both  conditions  PI  and  P2. 

Proof;  The  fact  that  Ad  is  a  basis  for  (<Sj(A))*  follows  easily  from  Lemma  3. 
To  construct  a  typical  basis  spline  Bj,  we  set  XiBj  =  Si j  for  alH  =  1, . . .  ,n. 
Then  the  remaining  nodal  values  XBj ,  X  G  A/a  \  Ad  are  computed  from  the 
smoothness  conditions  (9).  It  is  easy  to  see  that  the  support  of  the  resulting 
spline  is  at  most  the  star  of  a  vertex.  This  shows  that  PI  is  satisfied. 

It  remains  to  show  that  the  Bj  form  a  stable  basis.  This  follows  from  (8) 
by  a  standard  argument  [12],  provided  we  can  show  that 

II-SjIIoo  ^  N,  1  ^  ^  u.,  (11) 

where  K  is  a  constant  depending  only  on  d  and  the  smallest  angle  ^a  in  A. 
This  clearly  holds  if 


\XBj\  <K,  for  all  A  G  A^a 
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for  a  similar  constant  K.  By  construction,  |ABj|  <  1  for  all  A  G  M.  Since 
•A/a  \  C  let  us  take  an  arbitrary  vertex  of  A  and  notice  that 

if  A  G  7^2  5  then  XBj  can  be  nonzero  only  if  the  corresponding  Aj  lies  in 

7^2(v).  Therefore,  it  will  be  sufficient  to  show  that  |ABj|  <  K  for  all  j  such 
that  Aj  G  'Jl2{y)  and  all  Xell2{v)\M.  We  distinguish  four  cases. 

Case  1:  (u  is  a  boundary  vertex.)  In  this  case,  7^2 ('^)  \  M  =  {Ay„,  : 
i  =  l,...,n  —  1,  z/zi},  where  X^^^^  is  the  functional  included  in  M  in  step 
4a)  of  Algorithm  4,  Without  loss  of  generality  we  assume  that  n  =  1.  For 
any  s  G  Sj(A),  (9)  implies 

-Km^  +  ^2Xl,S 
=  AJ  -  (r2A5  pS  +  o-aAgpS 


n  — 1 


A" 

“^n  — l,m 


S  = 


i=2 


where  we  set 

Ui  :=  sm{9i  +  ^i-i). 

If  we  take  s  to  be  the  basis  spline  Bj  corresponding  to  a  Aj  G  72-2  (^^),  then 
all  but  one  of  the  values  on  the  right-hand  side  of  the  expression  for  AV^Bj 
vanishes,  and  thus 


\^i,Tn^j\  —  {AjBjl  —  1,  2  —  2, ...  ,n  1, 

which  proves  the  assertion. 

Case  2:  (u  is  an  interior  vertex  with  n  7^  4.)  In  this  case,  72-2  (i’)  \  Ad  = 

{AV^  :  i  =  i  ^  ii}U  {AV^,p}.  For  AV^s,  i  =  i  ^  ii,  the 

same  calculation  as  in  Case  1  applies:  we  start  from  and  calculate  AV^s 

consecutively  counterclockwise  until  AJ^_j  ,^s,  and  then  also  clockwise  until 
For  pS,  we  have  by  (9), 

=  +  (12) 

Therefore,  our  claim  will  be  established  if  we  show  that 

K-*|  =  |sm-H»i„  +  «(o-i)l<1^3  ifn/4,  (13) 

where  K3  is  a  constant  dependent  only  on  $a-  This  is  obvious  for  n  =  3. 
Assuming  n  >  5,  we  have  \9i  -h  ^2  +  ^3  +  ^4  —  27r|  >  ^a-  Hence, 

\9io  +  9i^^i  -  7r|  >  max{|^i  -f  6>2  -  7r|,  |^3  -f  6^4  -  7r|}  >  9^/2, 


and  (13)  follows. 


Stable  Local  Nodal  Bases 


177 


Case  3:  (v  is  a  singular  vertex.)  In  this  case,  7l2{v)\M.  =  i  =  2, 3,4} 

(where  we  assume  for  simplicity  that  ii  =  1).  Since  o-j  =  •  •  •  =  0-4  =  0  for  a 


singular  vertex,  (9) 

now  reduces  to 

i  =  1,2,3, 4. 

Therefore, 

i  =  2,3, 4, 

and  the  assertion  follows. 


Case  4;  (u  is  a  nonsingular  interior  vertex  with  n  =  4.)  We  proceed  as  in 
Case  2,  but  calculate  A^^  pS  differently.  At  first  glance  it  may  seem  that  (10) 
does  not  guarantee  stability  since  |crjo|  may  be  arbitrary  small  (in  the  case 
of  near-singularity),  while  A}^  pS  is  to  be  computed  from  (12).  However,  the 
complete  system  of  equations  (9)  for  7l2(v)  is 

ai\l^s  =  Xl^s-^XU,mS,  z  =  l,2,3,4. 

Taking  the  sum  with  alternating  signs,  we  get 

i=i 

and  hence 

for  every  s  =  Bj,  with  Xj  G  7^2 (^^)-  This  completes  the  proof  of  (11),  and  the 
theorem  has  been  established.  □ 

§5.  Stability  vs.  LLI 

We  recall  (cf.  [2,4, 6, 8, 9])  that  a  set  B  of  basis  splines  in  «S^(A)  is  called  locally 
linearly  independent  (LLI)  provided  that  for  every  T  G  A,  the  splines  [Bi  :  i  G 
Ex}  are  linearly  independent  on  T,  where 

Ex  :=  {«  :  TCsuppHj}.  (14) 

A  star-supported  LLI  nodal  basis  was  constructed  for  <Sj(A)  in  [4].  We  now 
establish  the  following  surprising  result. 

Theorem  6,  For  d  >  5,  it  is  impossible  to  construct  a  basis  for  <Sj(A)  which 
satisfies  both  conditions  P2  and  (14)  simultaneously. 

Proof:  Suppose  {Hi, . . . ,  Bn}  is  a  locally  linearly  independent  basis  for  «S}(A) 
on  a  triangulation  A  which  contains  an  interior  near-singular  vertex.  Suppose 
V  is  connected  to  ui , . . . ,  U4  in  counterclockwise  order,  and  let  be  the  edge 
(u,Ui),  Ti  the  triangle  (ui,ui+i,u),  and  6i  the  angle  between  and  Ci+i- 
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Suppose  that  none  of  ej  is  degenerate  at  v.  For  each  1  <  i  <  4,  let  Sj  be  the 
unique  spline  in  (A)  such  that 


\sj  =  0,  for  all  A  G  A/a  \  7^2  (^) 


Clearly, 

suppsj  =  Tj-i  UTj  UTj+i, 
and  we  can  write  (see  [2,9]) 

ieij 


where  Ij  :=  {i  :  suppJ5i  C  supps^}.  We  now  consider  the  spline 


S  =  -Si  +  52  -  53  +  54  =  -  ^  Ci 

i£h 


[1] 


i€h 


[2] 


Bi-J2 

ieh 


,[3] 


i€/4 


=  ^  OiBi. 

iehuhuhuh 


Using  the  smoothness  conditions  (9),  it  is  easy  to  see  that 

AS'.pi  =  0,  A;',^5  =  (-1)\  2  =  1, 2, 3, 4, 

As  =  0,  for  all  A  G  A/a  \ 

and  thus  ||s||oo  <  7^4,  where  K4  depends  only  on  d.  If  the  basis  {5i, . . . ,  5„} 
satisfies  P2,  we  get 

IHloo  <  iff^PIloo  < 

Moreover,  since  ^2,mBi  i=-  0  only  if  Ti  U  r2  U  T3  C  supp  5i,  we  have 


1  =  A^^^5  =  <  #/2||a||oomax|A5„,J5i|, 


with  I2  :=  {2  :  supp^i  =  Ti  U  T2  U  T3}.  Clearly,  #/2  <  3(‘^^^),  and  hence 
there  exists  C  I2  such  that 

I  >  iiTs  >  0, 

where  K5  depends  only  on  ^a-  However,  A]”  ^Bi^  —  0,  so  that  by  (9)  we  have 

"  |sm(@i  +  92)|''^"'"’^'°'  “  |sin(ei+e2)l’ 

which  is  unbounded  as  61  ^  62  tt.  In  view  of  the  Markov  inequality,  it 
follows  that  [[BiQlloo  is  unbounded.  But  then  the  basis  cannot 

be  stable.  □ 
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§6.  Remarks 

Remark  1.  Stable  local  bases  are  important  for  both  theoretical  and  practical 
purposes.  For  example,  it  can  be  shown  (see  [12])  that  if  a  spline  space  has 
such  a  basis,  then  it  has  full  approximation  power.  Applications  where  stable 
bases  are  useful  include  data  fitting  and  the  numerical  solution  of  boundary- 
value  problems. 

Remark  2.  For  d  >  5,  stable  local  bases  for  certain  superspline  subspaces  of 
5i(A),  can  be  constructed  using  classical  finite  elements,  see  [15].  However, 
it  is  also  important  to  have  such  bases  for  the  full  spaces  »Sj(A),  since  in 
contrast  to  supersplines,  they  are  nested,  i.e.,  <5^(Ai)  C  5^(A2)  whenever 
A2  is  a  refinement  of  Ai.  This  is  important  for  multiresolution  applications, 
see  [3,13]. 

Remark  3.  Algorithm  4  is  a  modification  of  the  algorithm  used  in  [14]  to 
construct  a  star-supported  basis  for  The  only  change  is  in  the  choice 

of  nodal  functionals  in  step  4b)  where  io  was  taken  to  be  any  index  such  that 
Cio  is  nondegenerate  at  v.  To  get  stability,  we  have  to  choose  iq  more  carefully. 
The  basis  constructed  in  Algorithm  4  is  not  locally  linearly  independent.  To 
get  an  LLI  basis,  step  4)  has  to  be  modified  in  a  different  way,  see  [4]. 

Remark  4.  Star-supported  bases  were  constructed  for  general  spline  spaces 
5J(A)  for  d  >  4r-\-l  in  [1],  and  for  d  >  3r-\-2  in  [10,11].  The  constructions  were 
based  on  Bernstein-Bezier  techniques,  and  are  not  stable  for  triangulations 
that  contain  near-degenerate  edges  and/or  near-singular  vertices. 

Remark  5.  In  [7]  we  use  Bernstein-Bezier  techniques  to  construct  stable  local 
bases  for  general  spline  spaces  55(A)  and  their  superspline  subspaces  for  all 
d  >  3r  -i-  2.  In  a  related  work  [6],  we  also  used  Bernstein-Bezier  techniques 
to  construct  locally  linearly  independent  bases  for  the  same  range  of  spline 
spaces  and  superspline  spaces.  For  more  on  LLI  spaces,  including  applications 
to  almost  interpolation,  see  [2, 4,6, 8, 9]. 

Remark  6.  Following  the  arguments  in  [7],  it  is  easy  to  show  that  a  natural 
renorming  of  our  stable  bases  is  Lp-stable  for  all  p  G  [l,oo]. 
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On  Lacunary  Multiresolution  Methods 
of  Approximation  in  Hilbert  Spaces 

Lubomir  T.  Dechevski  and  Wolfgang  L.  Wendland 


Abstract.  We  study  lacunary  multiresolution  methods  from  the  point 
of  view  of  their  analogy  to  the  use  of  near-degenerate  elements  in  finite 
and  boundary  element  methods.  The  main  results  are  characterization  of 
the  best  N-term  approximation  of  solutions  of  nonlinear  operator  equations 
and  best  N-term  approximation  by  near-degenerate  normal  approximating 
families  in  Hilbert  spaces. 


§1.  Introduction 

This  communication  is  part  of  a  sequence  of  papers  exploring  the  use  of  near¬ 
degenerate  elements  in  finite-  and  boundary-element  methods  (see  also  [5,6] 
and  their  wavelet-based  analogues,  lacunary  multiresolution  methods.  The 
use  of  near-degenerate  and  lacunary  methods  for  solving  operator  equations 
is  of  considerable  practical  significance  because  in  many  important  problems 
arising  in  industry,  engineering  and  natural  sciences,  the  use  of  such  methods 
leads  to  a  dramatic  reduction  of  execution  time  and/or  computer  resources. 
The  theoretical  justification  for  the  use  of  such  methods  is,  however,  very 
challenging:  it  has  been  successfully  carried  out  only  in  a  number  of  special 
cases,  by  specific  techniques  which  vary  from  case  to  case.  The  purpose  of 
this  sequence  of  papers  is  to  develop  a  general  approach  to  overcoming  the 
challenges  of  the  use  of  lacunary  multiresolution  and  near-degenerate  finite 
and  boundary  element  methods.  Because  of  the  limited  space  available,  we 
shall  consider  only  multiresolution  methods  for  operator  equations  in  Hilbert 
spaces,  with  an  outline  of  the  main  ideas  of  the  proofs,  which  in  the  Hilbert- 
space  case  are  simpler  and  relatively  short.  A  much  more  technically  involved 
and  detailed  discussion  of  both  near-degenerate  finite  elements  and  lacunary 
multiresolution  methods  and  the  important  parallel  between  them  will  be 
given  for  more  general  types  of  nonlinear  operators  in  quasi-Banach  spaces  in 
a  later  paper. 
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§2.  Approximate  Solutions  of  Nonlinear  Operator  Equations 

In  this  section  we  consider  a  general  class  of  nonlinear  operator  equations, 
and  study  the  numerical  solutions  of  these  equations  obtained  by  iterative 
and  projection  methods. 

Let  X,  Y  be  real  Hilbert  spaces.  The  class  of  nonlinear  operators  to  be 
considered  is  the  space  LH{X,Y)  of  all  Lipschitz  homeomorphisms  F  between 
A  and  y,  that  is,  3  on  Y  and  3  C(F,X,Y)  <  oo  :  ||F(a;i)  -  F(x2)||y  < 
C\\xi  —  X2\\x,  Va:iVa;2  E  X^  and  analogously  for  F~^. 

Let  be  a  Hilbert  space,  such  that  X  r\Y  C  H  and  A”  H  T  is  dense  on 
FT,  and  let  Y  be  the  dual  of  X  pivotal  to  /f,  i.e.,  the  dual  with  respect  to  the 
duality  functional  defined  by  the  scalar  product  of  H.  We  shall  denote  this 
dualby  r  =  A*  ^X*{H). 

Definition  1.  Let  Y  =  X*{H).  The  (generally  nonlinear)  operator  F  :  X 
X*  is  called  Lipschitz,  if 

3C(F,X,H)<,x>-.\\F{xi)-F(x2)\\x-  <C\\x,-x2\\x,  (1) 

Va:iVa:2  E  X,  and  strongly  monotone,  if 

3c{F,X,H)  >  0  :  {F{xi)  -  F{x2),xi  -  X2)h  >  c\\xi  -  X2l\x:  (2) 

\/xiVx2  E  X.  The  class  LSM  =  LSM{X,  H)  consists  of  exactly  those  F  : 
X  — >  X*{H)  which  satisfy  (1,2). 

It  can  be  shown  that  the  constants  C  and  c  in  (1,2)  are  related  by  c  <  (7. 
It  should  be  noted  that  the  typical  case  here  is  X  H  X*  or  X 
H  ^  X*,  where,  as  usual,  B  or  B  A  denotes  continuous  embedding: 
Ac  B  and  ||.||b  <  CH-Ha. 

Theorem  1.  (Generalization  of  Theorem  18.5  in  [11]  and  strengthening  of 
Theorem  18.5  in  [15]  for  the  case  of  Lipschitz  operators  in  Hilbert  spaces,  j  Let 
X  and  H  be  Hilbert  spaces  with  the  same  cardinality.  Then,  LSM(X,  H)  C 
LH(X,X*(H)). 

Proof:  (Outline.)  By  duality  arguments,  it  can  be  shown  that  the  cardi¬ 
nality  of  X*{H)  is  equal  to  that  of  X  and  H.  Therefore,  since  all  spaces 
are  Hilbertian  with  the  same  cardinality,  there  exist  linear  invertible  oper¬ 
ators  R  :  H  X  and  S  :  H  X*{H)  which  are  isometric  together 
with  their  inverses.  Hence,  the  equation  F(x)  =  y,  x  e  X,  y  €  X*{H)  is 
equivalent  to  the  equation  Av  =  w,  v,w  e  H,  where  Av  =  S~^FR.  Now, 
since  F  G  LSM{X,  H),  it  follows  from  //  =  ||i2||//_x  =  1,  that 

A  G  LSM{H,  H).  Therefore,  by  Theorem  A  (see  below),  F  is  bijective  from  X 
to  X*.  By  a  condition  of  the  theorem,  F  is  Lipschitz;  it  remains  to  prove  the 
same  for  Indeed,  by  the  strong  monotonicity  of  F,  setting  xi  =  F~^(yi), 
X2  =  F-'^{y2),  VyiV7/2  G  X* ,  we  get 

-  F-'(!/2)I&  <\{yi-  3/2.  F-\y,)  -  F-\y^))H 

which  completes  the  proof.  □ 
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Theorem  A.  (See  [11],  Theorem  18.5).  Assume  that  X  =  X*  =  H.  Then, 
LSM(H,H)  C  LH{H,H)  and  the  operator  Te,y{x)  =  x-e[F{x)-y],  x  e  H, 
is  contractive  in  H  for  0  <  e  <  uniformly  in  y  £  H,  where  C  and  c  are 
deSned  in  (1,2).  The  best  contraction  factor  is  1  -  and  is  achieved  for 

£  =  010“^. 

Following  the  idea  of  the  proof  of  Theorem  1,  Theorem  A  can  be  modified 
for  the  case  when  X  ^  H.  We  omit  the  details. 

In  the  remaining  part  of  this  section  we  shall  consider  methods  for  ap¬ 
proximate  solution  of  the  equation  F{x)  =  y,  x  £  X,  y  £Y,  where  X,  Y  are 
Hilbert  spaces. 

Definition  2.  (See  [12]j.  Let  X  be  a  Hilbert  space.  G  C  X  is  called  an 
existence  set  for  X,  if\fx  £  X  3gx  €  G  :  ||a?  -  gx\\x  =  minpeG  Ik  “  9\\x  = 
Eg{x)x-  (The  best  approximation  gx  need  not  be  necessarily  unique.)  The 
sequence  {Gn}n=i^  Gn  C  X  is  called  a  normal  approximating  family  in  X,  if 
for  any  iV  G  IN,  Gn  is  an  existence  set,  with  Gn  C  Gn+i  s,nd  Gn  —  Gn-i  C 
G2N- 

Obviously,  an  existence  set  in  X  is  closed  in  X  (typical  example:  any 
finite-dimensional  subspace  of  A). 

Definition  3.  Let  X  be  a  Hilbert  space.  The  sequence  {Gn}n=i  •  ^  ^ 

is  said  to  have  the  strong  approximation  property  (SAP,  for  short)  if\JN=i 
is  dense  on  X  in  the  inner-product  topology  of  X. 

Let  us  consider  now  the  Galerkin-Petrov  projection  methods.  Let  Pn  : 
X  ^  X,  Qn  :  X  ^  X  he  projectors  with  dim  Pn{^)  =  dim  Qn{X)  =  N, 
and  PnPn+1  =  Pn+iPn  =  Pn,  QnQn+i  =  Qn+iQn  =  Qn- 

Example  1.  (Galerkin-Petrov  method  for  monotone  operators.)  For  a  Hilbert 
space  X,  let  Y  =  X*(H).  The  equation  jP(a:)  =  y,  x  £  X,  y  £  X*,  is  replaced 
by  Q*nF{Pnx)  =  Q*Ny,  where  :  X*  ^  X*,dim  Q*n{X*)  =  N,  is  the 
Banach  adjoint  of  Qn-  The  N  x  N  nonlinear  system  is  determined  by 

{Q*nF(Pnx),  QNh)H  =  {QnV,  QNh)H-  (3) 

By  Lemma  23.1  in  [15],  it  follows  that  if  F  G  LMS{X,  H),  where  X  and 
H  are  separable,  then  (3)  has  a  unique  solution  for  N  large  enough. 

In  the  case  X  =  X*  —  H ,  if  N  is  large  enough,  so  that  QpfF(pNH)  = 
QnH  holds,  then,  by  Theorem  A,  (3)  can  be  computed  by  quickly  converging 
contractive  iterations.  For  small  N,  the  condition  Q*^F{PnH)  —  Q*nH  may 
fail  even  if  F  is  linear  and  Pn  =  Qn  (see  [2],  Theorem  10.1.1). 

If  F  is  twice  Gateau-differentiable,  then  Newton’s  method  can  be  used 
where  the  inverse  matrix  involved  in  each  iteration  is  usually  sparse.  In  gen¬ 
eral,  this  method  needs  an  appropriate  initial  approximation  xq  to  the  solution 
of  F(x)  =  y,  but  if  F  is  strongly  monotone  and  potential,  that  is,  if  there  exists 
a  real  functional  /  :  X  — >  H  such  that  F  =  grad  /,  then,  by  Theorem  5.1  in 
[15] ,  /  is  strictly  convex  and  the  solution  of  (3)  is  equivalent  to  minimizing  the 
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three  times  Gateau-difFerentiable  functional  /.  Hence,  Newton’s  method  con¬ 
verges  to  the  solution  of  (3)  for  any  a:o  G  X ,  the  rate  of  convergence  depending 
on  the  constant  c  in  (2).  This  technique  is  still  numerically  efficient  if  F  is  only 
Lipschitz,  and  Newton’s  method  or  its  various  modifications  be  replaced  by 
the  respective  variants  of  the  more  general  F.  Clarke’s  subdifferential  method. 
In  the  case  of  potential  F,  the  Bubnov-Galerkin  method  (P^r  =  Qn)  coincides 
with  the  Ritz  method  for  minimization  of  /. 

For  projection  methods  (see  Example  1),  the  strong  approximation  prop¬ 
erty  can  be  written  as  lim;v-»c5o  IK-fx  “  Pn)^\\x  =  0.  A  typical  example 
when  Gn  =  Pn^  forms  a  NAF  having  the  SAP  is  when  Pjv  is  obtained  by 
multiresolution. 

By  Theorem  23.3  in  [15],  if  X  is  separable,  then  Gn  —  PnX,  as  defined 
in  Example  1,  has  the  SAP;  by  Lemma  23.1  in  [15]  and  in  view  of  P  G 
LMS{X,H),  the  solution*  of  (3)  exists  for  N  large  enough  and  - 
^\\x  — ^  0,  where  x  is  the  solution  of  F{x)  =  y,  y  £  X* . 

Theorem  B.  (Cea’s  lemma  for  nonlinear  operators  (see  [13],  Lemma  2.8; 
[11],  Theorems  4.1  and  18.8.)  Under  the  assumptions  of  Example  1,  for  F  e 
LSM{XyH),  let  X  =  F~^{y)  e  X  be  the  solution  of  F{x)  =  x/  G  X*{H),  and 
Xn  ^  X  be  the  solution  of  (3).  Then, 

3C{F,X,II)  <  00  :  EMiF-\y))x  <  IlfH!/)  -  <  CEN(F-'(y})x. 


This  result  shows  that  Galerkin-Petrov  methods  (of  any  type  -  finite  el¬ 
ement  or  wavelet)  achieves  the  best  approximation  rates  up  to  a  constant 
factor. 


§3.  Best  jN-term  Approximation 

For  the  general  paradigm  of  best  N-term  approximation  (BNTAP)  we  refer 
to  [12],  section  3.5,  and  [8]. 

Definition  4.  Let  Xj,  Yj,  j  =  0,1  be  Hilbert  spaces,  Xi  ^  Xq,  Yi  ^  Yq, 
and  let  F  G  LHiXQ,Yo)  n  LH{Xi,Yi).  The  NAF  {Gn}^=.i  :  Gn  C  JAi,  is 
called  near-degenerate  of  order  (A;  a,/?),  A  >  0,  a  >  0,  P  >  0,  if  it  satisfies  a 
direct  inequality  of  the  type 


3C  <  00  :  Vj/eKi,  (4) 

where  C  =  C{N),  with  C  x  N^;  and  an  inverse  inequality  of  the  type 

3D<oo:  ||x|U,  <  x  €  Gk,  (5) 

where  D  =  D{N),  with  D  x  .  The  partial  case  a  =  f3  =  0  corresponds  to 
a  non-degenerate  (regular)  NAF. 
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CO 

A»(Xo)  :=  {feXo:  H/IUjfXo)  =  +  <  00} 

j=0 

(6) 

and  the  real  interpolation  space 

:=  {f  l|/||(yo,yi)e,,  = 

<  00}. 

3=0 

where  K{t,  f;Yo,Yi)  is  Peetre’s  i^-functional  (see  [2,12]),  0  <  t  <  oo,  s  > 
0,  O<0<1,  0<g'<oo  (with  the  usual  sup-modification  in  (6,7)  for 
q  =  oo).  (Recall  that  ^  Xq,  which  explains  the  presence  of  the  saturation 
term  ||/||3f^  in  (6,7).) 

Theorem  2.  (Characterization  of  the  best  N-term  approximation  of  solu¬ 
tions  of  nonlinear  operator  equations  by  near-degenerate  NAF  in  Hilbert 
spacesj.  Assume  that  the  conditions  of  Dehnition  4  hold.  Let  0  <  q  <  oo. 
Then, 

(i)  if  0  <  a  <  A  and  s:0<5<A  —  a,  then,  3Ci  <  oo  : 

l|J^~^(2/)IU^(Xo)  <  +  lbll(yo.yi)^,J;  (8) 

(ii)  if  /?  >  0  and  0  <  s  <  A  -f  /?,  then  3(72  <  co  : 

lly|l{yc,n)^„  <  C2[||f  (0)||y,  +  ||F-i(y)|U3(x„)l.  (9) 

Proof;  (Outline.)  By  a  standard  technique,  typical  for  BNTAP  (see  [12], 
Theorem  3.16  and  Corollary  3.7),  we  prove 

l|.F'"Hy)IUj(Xo)  <  Ci||.f’“'(2/)ll(Xo,Xi)^,,>  (10) 

l|f-‘(!/)ll(x.,xO^,.<«2||F-^(y)IU3(X„).  (11) 

By  obtaining  appropriate  upper  bounds  for  the  K-functionals  in  the  definition 
of  (Xo,  Xi)e^g  and  (Iq,  ^1)0,95  0  <  0  <  1,  and  using  the  embeddings  Xi  Xq, 
Yi  Yb,  it  can  be  shown  that,  for  Lipschitz  operators  F,  F~^, 

l|f’-'(y)ll(x„,xo^..  <  <^3(l|f'‘*(0)IU.  +  l|y||(y„,yOrf^.,)-  (12) 

lin*)ll(yo,yo^„  ^  <=4(||f (O)lln  +  lkll(x„.xo^J  (13) 

hold.  Combining  (10)  with  (12)  and  (13)  with  (11),  we  arrive  at  (8)  and  (9), 
respectively.  □ 
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Corollary  1.  Under  the  conditions  of  Theorem  2,  let  0  <  s  <  X  ~  a,  and 
assume  that  F(0)  =  Oyj,  F”^(0)  =  Oxi-  Then, 

A\(X^)  (yo,Y,)^,,.  (14) 

In  particular,  if  a  =  /?  =  0,  then 

(yo,n)^,  =  ^“(^o)  (15) 

(isomorphism  of  the  spaces,  equivalence  of  the  Hilbert  norms). 

Note  that  this  special  case  corresponds  to  sublinear  operators. 

Remark.  If  the  dependence  of  C  in  (4)  and/or  D  in  (5)  on  N  is  weaker  than 
polynomial,  e.g.,  logarithmic,  then  the  left-hand  and/or  right-hand  embedding 
in  (14)  can  be  sharpened  by  setting  a  =  0  and/or  /3  =  0  and  modifying  the 
index  q.  We  omit  the  details. 

Multiresolution  Galerkin-Petrov  methods  for  monotone  operators  (Ex¬ 
ample  1)  are  included  as  partial  cases  in  Theorem  2  and  Corollary  1.  For 
monotone  operators,  we  have  Yq  =  Xq{Ho),  Yi  =  where  Hq,  Hi 

are  Hilbert  spaces  with  Hq  ■<— ’  Hi  which  are  sufficiently  far  away  from  each 
other  so  that  Xq  Xi  and  Xq{Hq)  XI {Hi)  hold  simultaneously.  Here 
n  Yi  is  assumed  to  be  dense  in  Hq  and  Hi.  The  projectors  and  Qat 
in  Example  1  are  assumed  generated  by  multiresolution,  which  ensures  that 
Cat  —  Gn-1  C  G2N- 

In  the  rest  of  this  section  and  in  the  next  section  we  shall  discuss  how 
to  reduce  the  rates  a  and  (3  in  Theorem  2  and  Corollary  1  to  zero  in  the 
presence  of  near-degeneracy.  To  this  end,  we  shall  study  the  analogue  of 
the  phenomenon  of  near-degeneracy  with  multiresolution  methods  based  on 
biorthogonal  wavelets. 

One  equivalent  norm  in  the  Inhomogeneous  potential  spaces  H^  (cf.,  e.g., 
[14,4]  for  p  =  g  =  2)  is  given  by 


oo  2”-l 

ii/iu-  X  { Y,  E  E  (16) 

fcez"  j=o  ifcez"  i=i 

with  0  <  5  <  r,  where  in  [8]  r  is  the  Lipschitz  regularity  of  the  compactly 
supported  scaling  functions  (p  E  H'' ,  (p  E  H^  and  wavelets  E  H^ ,  ”0^^]  E 
H^  of  the  biorthonormal  wavelet  bases,  with  respect  to  which  f  E  H^  can  be 
expanded  as  follows: 


oo  2"-l 

/(®)  =  E  <^ok<pok{x) + E  E  E  (11) 

fccz;"  j=okc'z^  1=1 

where  aofc  =  {f,<Pok)L2,  j0jfc)L2-  Each  hypercube  in  the  Calderon- 

Zygmund  decomposition  of  and  Stein’s  construction  of  Whitney-type 
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extension  operators  corresponds  to  2"  —  1  basis  functions  I  = 

—  1,  in  each  of  the  two  biorthonormal  bases.  The  convergence  in 
(17)  is  in  the  norm  topology  of  but  also  Lebesgue  a.e.  on  the  domain  Q 
of  the  functions.  0  may  be  K,”,  hyperrectangle,  correspond  to  the  periodic 
case,  or  even  general  Lipschitz-graph  domain.  We  refer  to  the  currently  most 
advanced  work  on  this  topic  [3] ,  as  well  as  to  the  extensive  account  [4]  (for  the 
case  of  homogeneous  potential  spaces,  see  [7],  in  the  special  case  p  =  q~2). 

Definition  5.  Let  ji  G  IN.  A  non-degenerate  wavelet-based  projector  (NWP) 
is  denoted  by  and  defined  by 


k  j=0  k  i=l 

cf.  (17).  A  near-degenerate  wavelet-based  projector  (NDWP)  is  denoted  by  Ph 
and  defined  by 

2"-l 

k  k  j=0  1=1 

J{ji,k)  >  J{ji  —  1,  A;),  J{jiyk)  ^  ji',  ^kj^  :  J{jiykj^)  —  ji-  (20) 

In  other  words,  for  a  NWP  J{jitk)  =  ji  =  const,  uniformly  in  k.  Thus, 
NDWP’s  are  a  specific  partial  case  of  lacunary  wavelet-based  projectors  (see 
the  concluding  remarks  in  [4],  subsection  6.2),  lacunarity  being  with  respect 
to  the  NWP  corresponding  to  Ji  :=  maxfc  J{ji,  k). 

Example  2.  One  example  when  near-degenerate  FEM  or  lacunary  wavelet- 
based  projectors  of  NDWP  type  are  needed  is  in  the  error  analysis  of  numerical 
solutions  in  the  immediate  neighbourhood  of  the  boundary  dft  (see,  e.g.,  [11], 
Fig.  3.14,  3.15,  6.14,  6.15,  8.12).  Then  it  is  desirable  to  ensure  that  the  local 
approximation  rates  near  and  on  dQ  do  not  deteriorate  compared  to  the  local 
approximation  rates  in  the  interior  of  (1.  Indeed,  assume  that  dO,  is  regular 
enough  (Lipschitz  or  smoother).  Then,  by  the  trace  theorem  (see,  e.g.,  [1,9]), 
if  /  G  if®(0),  ft  C  m”,  then  the  restriction  of  /  on  is  less  regular,  namely, 
f\dn  ^  holds.  Then,  the  local  approximation  rate  achieved  via 

NWP,  given  in  (18),  is  0(2“^^®)  in  the  interior  of  ft  and  only  0(2“-^^^®“^/^^) 
near  50.  To  achieve  the  desired  uniform  distribution  of  the  error  in  the  interior 
and  near  the  boundary  when  /  is  smooth  enough  [s  >  1/2),  NDWP  given  in 
(19,20)  should  be  employed,  with  J{ji,k)  ^  ji  for  k  corresponding  to  the 
interior  of  0,  and  with  J{jiyk)  x  Ciji  C2  otherwise,  where 

In  the  context  of  Theorem  2  and  Corollary  1,  if  Xj  =  i7®J ,  j  =  0, 1,  with 
So  <  Si  so  that  Xi  Xq  is  fulfilled,  then  it  can  be  verified  that  for  NWP 
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both  the  direct  inequality  ||/  -  Pjj/||xo  ^  C'i2“'^^^||/||xi  and  the  inverse 
inequality  ||-Pji/||xi  <  ^22^^^ll^ji/llxo  hold,  with  A  =  Si  ~  5o  and  with 
Cl,  C2  independent  of  j\.  Hence,  in  Definition  4  a  =  ^  =  0  is  attained.  On 
the  contrary,  for  NDWP  satisfying  (21)  the  constants  Ci  and  C2  depend  on 
ji  and  a  >  0,  ^  >  0  holds. 

In  the  case  of  NDWP,  is  it  possible  to  somehow  reduce  a  and  /?  to  zero, 
thereby  achieving  isomorphism  in  (14)?  It  turns  out  that  the  answer  is  posi¬ 
tive,  and  below  we  shall  propose  a  general  method  how  to  achieve  this. 

Our  approach  will  be  consider  more  general  spaces  Xq,  Xi  than  ,  so 
that,  for  the  new  X^  and  Xi,  a  =  /3  =  0  holds.  Consider  the  Hilbert  space 
with  norm 


00  2"-! 

k  j=0  k  1=1 

The  spaces  from  this  scale  still  admit  atomic  decomposition  via  the  same 
Riesz  bases  of  biorthonormal  wavelets  as  The  weight  w{j,k)  is  positive, 
monotonously  increasing  function  in  j  for  each  fixed  k,  and  depends  on  the 
choice  of  J{ji,k)  in  (20).  The  definition  of  w{j,k)  is 

w{J{juk),k)=ju  (22) 

^(i,^=ii-l.  j  ~  J{ji J{ji -l,k) -\-l,  J{ji,k)  -  1,  (23) 
Vii  €  IN  Vfc  G  ZZ". 

Now,  take  Xj  =  j  =  0,1,  with  sq  <  ^i.  It  can  be  seen  that 

Xi  ^  Xq  holds,  and  we  can  consider  this  pair  of  spaces  in  the  context  of 
Theorem  2  and  Corollary  1. 

Corollary  2.  Under  the  conditions  of  Corollary  1,  assume  that  Xj  = 
j  =  0, 1,  where  w  =  w(j,  k)  is  the  left  inverse  (see  (22,23))  of  J{j,  k)  as  defined 
in  (20).  Assume  also  that  N  =  2^  and  Gn  =  Pj^Xq,  where  the  NDWP  Fj, 
is  defined  in  (19),  with  the  same  J{j,  k)  in  (20).  Let  s  :  0  <  s  <  X  =  si  —  sq- 
Then  (15)  holds. 

Proof:  (Outline.)  It  can  be  verified  that  the  bounds 

00 

ll/-4/ll«>o-<(E  E  ^24) 

j=ii  k:w{j,k)>ji  I 

||4/IIh...»<(EI«o.|^  +  E  E  (25) 

k  j=^  k:w{j,k)<ji  I 

hold.  (Recall  that  ji  =  minjt  J{ji,k),  Ji  =  max^  J{ji,k).)  After  some  com¬ 
putations,  (24)  and  (25)  imply 

11/  -  4/IUo  <  Ci2-^>^||/|U„  V/  €  Xi. 


(26) 
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II4/IU.  <  C22«^||4/||x„,  V/  €  Xo,  (27) 

with  X  =  Si  -  So,  and  the  constants  Ci  and  C2  in  (26,27)  do  not  depend  on 
ji,  i.e.,  for  this  choice  of  the  spaces  Xq,  Xi  in  Definition  4  o;  =  /5  =  0  holds. 
The  result  now  follows  from  Corollary  1.  □ 

Thus,  we  have  solved  the  problem  of  characterizing  the  best  approxima¬ 
tion  spaces  induced  by  NDWP  defined  in  (19)  and  (20).  In  this  approach  we 
remained  entirely  within  the  classical  BNTAP.  There  is  also  another  approach 
which  goes  beyond  the  general  BNTAP,  by  abandoning  the  use  of  the  real 
interpolation  functor.  This  approach  leads  to  atomic  decomposition  of  Wiener 
amalgam  spaces  and  will  be  considered  elsewhere. 
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Abstract.  We  address  the  problem  of  controlling  the  curvature  of  a 
Bezier  curve  interpolating  a  given  set  of  data.  More  precisely,  given  two 
points  M  and  AT,  two  directions  u  and  v,  and  a  constant  fc,  we  would 
like  to  find  two  quadratic  Bezier  curves  Fi  and  r2  joined  with  continuity 
and  interpolating  the  two  points  M  and  N,  such  that  the  tangent 
vectors  at  M  and  N  have  directions  u  and  v  respectively,  the  curvature  is 
everywhere  upper  bounded  by  fc,  and  some  evaluating  function,  the  length 
of  the  resulting  curve  for  example,  is  minimized.  In  order  to  solve  this 
problem,  we  first  need  to  determine  the  maximum  curvature  of  quadratic 
Bezier  curves.  This  problem  was  solved  by  Sapidis  and  Frey  in  1992.  Here 
we  present  a  simpler  formula  that  has  an  elegant  geometric  interpretation 
in  terms  of  distances  and  areas  determined  by  the  control  points.  We 
then  use  this  formula  to  solve  the  variant  of  the  curvature  control  problem 
in  which  Fi  and  F2  are  joined  with  continuity  where  the  length  a 
between  the  first  two  control  points  of  Fi  is  equal  to  the  length  between 
the  last  two  control  points  of  F2,  and  where  a  is  the  evaluating  function 
to  be  minimized. 


§1.  Introduction 

An  important  problem  in  CAGD  is  the  construction  of  curves  interpolating 
given  sets  of  data  that  also  satisfy  constraints  on  their  curvature.  Such  curves 
are  visually  pleasing  and  are  said  to  be  “fair”  [1,2].  Fair  curves  are  also  impor¬ 
tant  in  the  design  of  highways,  railways  and  trajectories  of  mobile  robots  (see 
[9]  and  [6]).  In  these  applications,  curvature  continuous  curves  with  bounded 
curvature  are  desirable.  Constructing  fair  curves  has  been  the  subject  of  recent 
research;  see,  for  example,  [4,5,7]  for  results  about  constraining  the  curvature 
at  the  endpoints,  and  [3,8]  for  results  about  monotonicity  of  curvature. 

In  this  paper  we  consider  the  problem  of  controlling  the  curvature  along 
the  whole  length  of  a  Bezier  curve  interpolating  a  given  set  of  data.  More 
precisely,  given  two  points  M  and  N,  two  directions  u  and  v,  and  a  constant 
k,  we  want  to  find  two  quadratic  Bezier  curves  Fi  and  r2  joined  with  continuity 
and  interpolating  the  two  points  M  and  N,  such  that  the  tangent  vectors 
at  M  and  N  have  directions  u  and  v  respectively,  the  curvature  is  everywhere 
upper  bounded  by  k,  and  some  evaluating  function,  the  length  of  the  resulting 
curve  for  example,  is  minimized.  We  call  this  the  curvature  control  problem. 
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In  order  to  solve  this  problem,  we  first  need  to  determine  the  maximum 
curvature  of  quadratic  Bezier  curves,  that  is,  to  find  an  exact  formula  in 
terms  of  the  control  points.  Note  that,  for  our  problem,  it  is  not  sufficient  to 
compute  the  maximum  curvature  of  a  particular  Bezier  curve  using  numerical 
methods.  Note  also  that  a  quadratic  Bezier  curve  is  a  parabola  and,  although 
it  presents  no  special  difficulties  to  compute  the  maximum  curvature  of  a 
parabola  in  terms  of  the  coefficients  of  its  implicit  equation,  what  we  require 
is  a  formula  in  terms  of  the  control  points. 

In  [8],  Sapidis  and  Frey  give  a  formula  for  finding  the  maximum  curvature 
for  quadratic  Bezier  curves.  In  Section  2,  we  recall  these  results  and  present 
a  simpler  formula  that  has  an  elegant  geometric  interpretation  in  terms  of 
distances  and  areas  determined  by  the  control  points.  We  then  use  this  formula 
to  solve  variants  of  the  curvature  control  problem.  Definitions  and  motivations 
for  these  variants  are  presented  in  Section  3.1.  We  solve  in  Section  3.2  the 
version  of  the  curvature  control  problem  where  Fi  and  r2  are  joined  with 
continuity  where  the  length  a  between  the  two  first  control  points  of  Fi 
is  equal  to  the  length  between  the  two  last  control  points  of  F2,  and  where  a 
is  the  evaluating  function  to  be  minimized.  In  Section  3.3,  we  prove  that  if  we 
require  in  the  previous  variant  a  continuity  instead  of  at  the  junction 
point,  then  there  exist  non-degenerate  data  for  which  there  is  no  solution  to 
the  curvature  control  problem.  However,  if  a  solution  exists,  we  show  how  it 
can  be  computed. 

Throughout  the  paper,  curvature  refers  to  non-signed  curvature,  unless 
otherwise  indicated.  We  denote  by  |jp^li  the  distance  between  points  p  and  q, 
and  by  “  x  ”  and  “ the  outer  and  inner  products,  respectively,  between  two 
vectors. 


§2.  Maximum  Curvature  of  Quadratic  Bezier  Curves 

Let  F  be  a  quadratic  Bezier  curve  with  control  points  po,  pi  and  p2  (see 
Figure  1).  Recall  that  F  is  defined  for  every  t  in  [0, 1]  by  T(t)  =  (1  -  t)^Po  + 
2t(l  -  t)pi  -h  t^P2-  Let  A  be  the  area  of  the  control  triangle  P0P1P2  and  m 
be  the  midpoint  of  the  segment  poP2-  We  assume  that  F  does  not  degenerate 
into  a  line  segment,  i.e.,  po,  Pi  and  p2  are  not  collinear. 

Theorem  1.  The  maximum  curvature  of  a  quadratic  Bezier  F  is  either  equal 
to  ||pimlP/v4^  if  Pi  lies  strictly  outside  the  two  disks  of  diameter  pom  and 
mp2,  or  is  equal  to  max{Koj«^i}  where  kq  =  -4/||poPill^  and  ki  =  .4./l|piP2|P 
are  the  curvature  of  F(t)  at  the  endpoints  F(0)  and  F(l). 

Before  proving  Theorem  1,  we  recall  the  result  by  Sapidis  and  Frey  [8] 
characterizing  quadratic  Bezier  curves  with  monotone  curvature. 

Theorem  2  [8].  The  quadratic  Bezier  curve  F  has  monotone  curvature  if 
and  only  if  one  of  the  angles  /.{popim)  and  l(mpip2}  is  equal  to  or  larger 
than  |.  In  other  words,  F  has  monotone  curvature  if  and  only  ifpi  lies  on  or 
inside  one  of  the  two  circles  having  as  diameter  p^m  and  mp2  (see  Figure  1). 
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Fig.  1.  The  quadratic  Bezier  curve  V  has  non-monotone  curvature  because  pi 
lies  strictly  outside  the  two  circles. 

Sapidis  and  Frey  also  present  in  [8]  the  following  expressions  for  the  max¬ 
imum  curvature  of  quadratic  Bezier  curves.  When  the  curvature  is  not  mono¬ 
tone  along  r,  then  its  maximum  curvature  is  4:al/\\poB\\^,  where  (see  Figure  1), 
a  is  the  distance  between  po  and  P2j  Hs  the  distance  between  pi  and  the  line 
joining  po  and  p2>  and  Ijpo-®!!  is  the  distance  between  po  and  the  line  passing 
through  p2  and  directed  by  pipo  -f  pipi-  When  the  curvature  is  monotone 
along  r,  its  maximum  is  reached  at  one  endpoint  po  or  p2  of  the  curve,  and  is 
“  2II?^  respectively. 

We  are  now  ready  to  prove  Theorem  1.  Note  that  the  area  A  of  the 
control  triangle  poPiP2  is  equal  to  al/2.  Thus,  in  order  to  prove  Theorem  1, 
based  on  the  results  by  Sapidis  and  Frey,  it  suffices  to  prove  that  8.4/||po-Sp  = 
||pim||^/v4^  or  2A  =  ||pim||.||p0'5||-  For  completeness,  we  show  how  our  result 
is  derived  from  Theorem  2. 

We  assume  first  that  pi  lies  strictly  outside  the  two  disks  of  diameter  pom 
and  mp2.  Thus,  the  curvature  K(t),  t  €  [0, 1],  of  the  quadratic  Bezier  curve  F 
is  not  monotone  by  Theorem  2.  It  follows  that  the  maximutti  curvature  of  F 
is  obtained  when  the  derivative  of  K{t)  is  zero. 

The  first  and  second  derivatives  of  the  Bezier  curve  F  are 

F'(t)  =  2((1  -  t){pi  -  Po)  +  t{p2  -  pi)) 

=  2(pi  —  Po)  +  2t(p2  -  2pi  +  Po),  (1) 

T”{t)  =  2(p2  —  2pi  -I-  Po).  (2) 

The  curvature  of  F  at  T(t)  is  thus,  for  any  t  G  [0, 1], 

^  \r\t)xr'(t)\  ^  |4(pi-po)x(p2-pi)| 

||F'(t)f  ||F'(t)||^ 


giving 


K{t)  = 


SA 

r'Wf’ 


(3) 
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where  A=  \{pi  -  Po)  x  (P2  -  Pi)  1/2  is  the  area  of  the  control  triangle  poPiP2- 
The  derivative  of  K,{t)  is 

^  -24.4(||r'(f)||)'  ^  -12.4(||r(t)fy 

rwr  lir'Wf 

Since  we  assumed  that  the  Bezier  curve  T  is  not  degenerate,  po,  pi  and  p2  are 
not  collinear  and  thus  A  ^  0-  Thus,  =  0  if  and  only  if  (||r'(^)||  )'  =  0, 
or  alternatively,  T'(t)  •  r"(t)  =  0.  Using  (1)  and  (2),  we  get 

r'{t)  •  r”{t)  =  4[{p2  -  2pi  +  pQ)t  +  {pi  -  po)l  *  [p2  -  2pi  +  Pol  =  4{at  -  /?), 

where  a  =  \\p2  -  2pi  +  Po||^  and  P  =  -{pi  -  po)  •  (p2  “  2pi  +  po)- 

Thus,  the  derivative  of  the  curvature  K{t)  vanishes  if  and  only  if  i  =  r  = 
P/a.  Note  that  r  is  in  (0, 1)  because  the  curvature  of  T  is  not  monotone  by 
assumption.  Therefore,  the  maximum  curvature  along  F  is  obtained  for  t  =  t. 
2A 

Lemma  3.  ||r'(r)||  = 

Proof:  By  (1),  the  square  of  the  first  derivative  of  r(i)  at  r  is 

||r'(r)||^  =  4[(p2  -  2pi  +  po)r  +  (pi  -  po)]^  =  4(ar^  -  2tP  +  ||poPi|i^) 

=  4(a^  -  2-p  +  IIpoPiII^)  =  -(allPoPif  -  P'^)^ 
a^  a  a 

where,  as  before,  a  =  \\p2  -  2pi  4-po||^  and  P  =  -(pi  —  po)  •  (P2  —  2pi  + 
Po).  Since  P2  -  2pi  +  po  =  PiPo  +  P1P2  =  2plm,  we  get  a  =  4||pim||^, 
P  =  —  2poPi  '  PittI,  and  thus 

||r'(r)f  =  |j^^^(4||pimf  IIpoPiII^  -4{^-^)^ 

It  follows  from  the  canonical  equation  {U  x  F)^  +  {U  •  F)^  =  C/^F^,  for  any 
two  vectors  U,  F,  that 


l|r'{r)f  = 


Now,  |p^  xplm|  is  equal  to  A,  the  area  of  the  control  triangle  poPiP2-  Indeed, 
pTTtI  =  (plj^  +pIP2)/2  and  thus  |p^  x  plm|  =  |p^  x  pI^|/2  =  A.  Thus, 
||r'(r)|p  =  l\\p\m\\^  which  yields  the  result.  □ 

The  expression  of  Kmax  =  ^('r)  now  follows  easily.  By  Lemma  3,  ||r'(r)||^ 
is  equal  to  8.4^/||pim||^.  Thus,  (3)  gives 

That  ends  the  proof  of  Theorem  1  when  pi  lies  strictly  outside  the  two  disks 
of  diameter  pom  and  mp2. 
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When  Pi  lies  inside  one  of  these  disks,  Sapidis  and  Frey  (see  Theorem  2) 
proved  that  the  curvature  of  the  quadratic  Bezier  curve  F  is  monotone.  The 
maximum  curvature  is  thus  the  curvature  at  one  endpoint  r(0)  or  r(l).  Equa¬ 
tion  (1)  gives  r'(0)  =  2(pi— po)  and  r'(l)  =  2(p2-pi)-  It  then  follows  from  (3) 


that 


and 


k{1)  = 


A 

lbiP2lP’ 


§3.  Controlling  the  Curvature  of  Piecewise  Quadratic  Bezier  Curves 
3.1.  Preliminaries 

Let  Fi  and  F2  denote  two  quadratic  Bezier  curves  with  control  points  {po,pi, 
P2)  and  {qoTqi,q2)  respectively,  and  let  F  denote  the  concatenation  of  Fi  and 
F2.  The  general  curvature  control  problem  we  address  is: 

Given  two  points  M  and  N,  two  unit  vectors  u  and  v,  and  a  constant 
k,  we  would  like  to  find  two  quadratic  Bezier  curves  Fi  and  F2  joined 
with  continuity  (at  P2  ~  qo ),  interpolating  the  two  points  M  and 
N  (atpo  and  q2  respectively) ,  such  that  the  tangent  vectors  at  M  and 
N  have  directions  u  and  v,  respectively,  the  curvature  is  everywhere 
upper  hounded  by  k,  and  some  evaluating  function  is  minimized. 

We  consider  without  loss  of  generality  A:  =  1;  for  any  fc  7^  0,  we  can  obtain  an 
equivalent  problem  where  fc  =  1  by  scaling  the  plane. 

The  curves  Fi  and  F2  are  connected  (at  p2  =  ?o)  with  continuity  if 
and  only  if  there  exists  p  G  (0, 1)  such  that  p2  =  q^  =  ppi  +  (1  —  p)qi-  The 
curve  F  interpolates  M  and  N,  such  that  the  tangent  vectors  at  M  and  N 
have  directions  u  and  v,  respectively,  if  and  only  if  po  =  Q2  =  ^  and  there 
exists  a  and  /?  positive  real  numbers  such  that  pi  —  po  =  cxu  and  q2  —  qi—  (du 
(see  Figure  2).  One  way  to  solve  the  general  curvature  control  problem  is  to 

1)  find  the  set  of  (a,fd,p)  G  (0, +00)^  x  (0, 1)  on  which  the  curvature  of  F 
is  everywhere  smaller  or  equal  to  1,  and  then, 

2)  find  a  value  (a,/?, p)  in  that  set  for  which  the  evaluating  function  is 
minimized. 

In  general,  this  is  a  non-linear  optimization  problem  with  non-linear  con¬ 
straints,  and  thus,  cannot  necessarily  be  solved  quickly  and  accurately. 
Clearly,  the  difficulty  depends  on  the  complexity  of  the  set  of  feasible  solutions 
and  on  the  evaluating  function  that  is  to  be  minimized.  Here  we  consider  sim¬ 
plifying  assumptions.  First,  we  require  a  continuity  C^  at  the  junction  point 
between  the  two  curves  Fi  and  F2.  This  fixes  p  to  1/2  and  reduces  the  num¬ 
ber  of  variables  to  two.  To  bring  the  number  of  variables  down  to  one,  we 
arbitrarily  consider  a  =  p.  We  then  choose  as  evaluating  function  the  length 
a.  By  minimizing  a,  we  ensure  that  all  the  control  points  pi,  p2  =  qo  and 
qi  remain  close  to  the  the  points  M  and  N  we  want  to  interpolate;  in  other 
words,  by  minimizing  a,  we  expect  that  the  length  of  the  resulting  curve  F 
will  not  be  too  far  from  its  minimum.  With  these  further  assumptions,  we 
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Fig.  2.  Curvature  control  problem  with  continuity  and  a  ^  p. 

solve  (in  Section  3.2)  the  given  interpolation  and  minimization  problem,  ex¬ 
cept  for  the  degenerate  case  when  u  and  v  are  parallel,  for  which  we  prove 
that  a  solution  does  not  necessarily  exist. 

In  Section  3.3,  we  also  consider  a  =  p,  but  we  require  a  continuity 
(instead  of  C^)  at  the  junction  point  between  the  two  curves  Fi  and  r2.  In 
other  words,  we  require  the  signed  curvature  to  be  continuous  on  F.  The 
variables  are  then  reduced  to  but  the  constraint  that  the  continuity  is 

links  these  two  variables,  and  thus  the  problem  is  actually  one-dimensional. 
We  prove  in  Section  3.3  that  this  set  of  additional  constraints  is  too  restrictive 
in  the  sense  that  there  exists  non-degenerate  data  (M,  N,u,v)  that  cannot  be 
interpolated.  However,  if  a  solution  exists,  we  show  how  it  can  be  computed. 

3.2.  Curvature  control  problem  with  continuity 

We  consider  here  the  following  variant  of  the  curvature  control  problem: 
Given  two  points  M  and  N,  and  two  unit  vectors  u  and  v,  we  want 
to  find  two  quadratic  Bezier  curves  F i  and  F 2  joined  with  continuity 
(at  p2  =  qo),  interpolating  the  two  points  M  and  N  (at  po  and  q2 
respectively),  such  that  the  tangent  vectors  at  M  and  N  have  direc¬ 
tions  u  and  V  respectively,  the  maximum  curvature  of  the  two  curves 
is  smaller  or  equal  to  1,  the  distances  a  =  ||poPill  <^'>^d  p  =  Iki92tl 
are  equal,  and  such  that  a  is  minimized. 

See  Figure  2. 

We  show  in  this  section  how  to  solve  this  problem  for  non-degenerate 
data,  that  is  when  u  and  v  are  not  collinear.  When  u  and  v  are  collinear,  we 
show  that  there  is  not  necessarily  a  solution. 

As  we  said  in  Section  3.1,  this  problem  is  equivalent  to  finding  the  smallest 
a  e  (0, +00)  such  that  the  curvature  of  Fi  and  F2  is  everywhere  smaller  or 
equal  to  1,  where  po  =  M,  92  =  A,  pi  =  Po  +  au,  qi  —  q2  -  ocv  and 
P2  =  go  =  (pi  +gi)/2. 

We  show  how  we  compute  the  smallest  a  G  (0,  +00)  such  that  the  cur¬ 
vature  of  Fi  is  everywhere  smaller  or  equal  to  1.  Computing  the  smallest 
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a  e  (0,  +oo)  for  r2  can  be  done  similarly.  We  then  return  the  curve  T  defined 
by  the  biggest  of  those  two  a. 

First,  for  any  value  a  G  (0,  +oo),  we  need  to  determine  an  expression  for 
the  maximum  curvature  of  Fi.  By  Theorem  1,  it  remains  to  determine  whether 
the  maximum  curvature  of  Fi  is  given  by  the  maximum  curvature  (Fi) 
of  the  parabola  supporting  Fi,  or  by  «o{ri)  or  /ci(Fi),  the  curvature  of  Fi  at 
its  endpoints  Fi(0)  or  Fi(l),  respectively.  Thus,  for  any  value  a  G  (0, +oo), 
we  want  to  decide  whether  pi  belongs  to  one  of  the  disks  of  diameter  pom 
and  mp2,  where  m  is  the  midpoint  of  poP2  (see  Figure  2).  Let  c  and  c'  be  the 
respective  centers  of  these  disks  and  R  be  their  radius.  In  order  to  determine 
whether  pi  belongs  to  one  of  these  disks,  we  compute  and  compare  with 
the  distances  ||pic||^  and  ||pic'|p. 

Since  pi  and  qi  are  linear  in  a,  and  p2  =  (pi  +  9i)/2,  m  =  (po  +P2)/2, 
c  =  (po  +  rn)/2,  and  c'  =  (m  +  P2)/2,  we  have  that  (c  -  po)^,  (c  -  pi)^ 
and  (c'  -pi)^  are  of  degree  2  in  a.  Thus,  7?^  <  ||pic|p  and  R?  <  ||pic'|p 

are  inequalities  of  degree  at  most  2  in  a  (namely  a  >  and 

a^[(5u  +  3u)^  —  (u  —  u)^]  -  2Q!(16it  +  8v)  ■  pogi  +  8||po92|P  >  0).  By  solving 
these  equations,  we  get  a  partition  of  (0,  +oo)  into  two  sets  of  intervals  Z  and 
I'  such  that  the  maximum  curvature  of  Fj  is  given  by  (Fi)  for  any  a  G  Z, 
and  by  max(/«o(Fi), «i(Fi))  for  any  a  el'. 

With  .4(poPiP2)  denoting  the  area  of  the  control  triangle  P0P1P2,  we  get 
by  Theorem  1,  when  po,  pi  and  p2,  are  not  collinear. 


.(ri)^ 


llpimir 

A(poPiP2y  ’ 


«o(ri)" 


A(poPiP2y 

Ibopill® 


and  aci(Fi)^ 


>A(poPiP2)^ 

\\PlP2\\^ 


A  straightforward  computation  gives 


p^ -\r  ot{-Su  -  v)  - ^  ^  ^  p^-ct(u-\-v) 

Pim  = - - - ,  popi  =  au  and  pip2  = - 


Thus,  .4(poPip2)  =  IpoPi  X  pI^|/2  =  \au  x  p^  —  a^u  x  i;|/4  and 


..  (r  ^2  _  («^(3w  +  v)^  ~  2a{3u  +  v)  -  ^  +  ||^||^)^ 
lQ(a^u  X  V  —  au  X  po92)^ 


16“  (a{u  +  v)  -  P092)® 


Thus,  (Fi)^  <  1,  «o(Fi)^  <  1  and  Ki(Fi)^  <  1  reduce  to  inequalities 
in  a  of  degree  at  most  8,  6  and  6  respectively.  Finding  the  intervals  of  Z 
and  Z'  on  which  those  inequalities  are  satisfied  can  therefore  simply  be  done 
by  computing  the  roots  of  the  corresponding  equations.  More  precisely,  the 
smallest  of  (i)  the  smallest  root  of  Kmax{I'i)^  =  1  in  Z,  and  (ii)  the  smallest 
root  of  /to(ri)^  =  1  and  «;i(Fi)2  =  1  in  Z\  is  the  smallest  a  for  which  the 
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Fig.  3.  Case  where  poi  Pi  p2  are  collinear  and  consecutive. 

maximum  curvature  of  Fj  is  smaller  or  equal  to  1.  Such  a  solution  exists  when 
u  X  u  ^  0  because  the  maximum  curvature  of  F i  goes  from  +oo  to  0  since 
Kmax (Fl)^  ko(Fi)^  and  ki(Fi)2  tend  to  +oo  when  a  tends  to  0,  and  tend  to 
0  when  a  tends  to  +oo. 

We  have  shown  that,  when  u  x  u  ^  0,  the  smallest  a  G  (0,  +00)  such 
that  the  curvature  of  Fi  is  everywhere  smaller  or  equal  to  1,  and  such  that 
the  control  points  po,  Pi  and  p2  are  not  collinear,  exists  and  we  can  compute 
it.  Suppose  now  that  there  exists  o;  G  (0, +00)  such  that  po,  pi  and  p2  are 
collinear  (see  Figure  3).  Assume  furthermore  that  pi  lies  in  between  po  and 
P2;  otherwise,  Fi  is  not  smooth  and  does  not  satisfy  the  constraint  on  the 
curvature.  Since  p2  is  the  midpoint  of  pi^i,  it  follows  that  po,  pi,  P2  and  qi 
are,  in  this  order,  on  the  line  L  passing  through  po  and  directed  by  u  (the 
line  is  necessarily  directed  by  u  because  pi  ^  po  belongs  to  that  line).  With 
u  X  V  ^  0,  q2  does  not  belong  to  L,  Thus,  for  a  <  d,  the  triangle  P0P1P2  is 
not  flat  but  tends  to  a  flat  triangle,  with  flat  vertex  at  pi,  as  a  tends  to  a. 
Therefore,  when  a  tends  from  below  to  d,  Fi  tends  to  a  straight  line  segment, 
and  the  maximum  curvature  of  Fi  tends  to  0.  Thus,  there  exists  a  <  d  such 
that  the  maximum  curvature  of  Fi  is  smaller  than  1 .  It  follows  that  d  is  bigger 
than  the  smallest  solution  a  we  found  previously.  Therefore,  when  u  x  u  /  0, 
there  is  always  an  optimal  solution  with  po,  pi  and  p2  not  all  collinear. 

We  now  show  that,  when  u  x  u  =  0,  there  may  not  exist  a  solution. 
Assume  for  example  that  poqi  is  not  parallel  to  u  and  u,  and  that  u  +  u  =  0. 
Then,  when  a  tends  to  0,  /co(Fi),  ki(Fi)  and  Kmaxi'^i)  tend  respectively  to 
+00,  0  and  +00.  Similarly,  when  a  tends  to  +00,  they  tend  respectively  to 
0,  4-00  and  4-oo.  It  follows  that  max(Ko(Fi), ki(Fi))  and  «:maT(Fi)  tend  to 
-foo  when  a  tends  to  0  and  4-oo.  In  addition,  ko(Fi),  ki(Fi)  and  ^max  (r'l ) 
are  never  equal  to  0  because  then  ]|pim||  =  0  or  .A(poPip2)  =  0  which  would 
imply  that  po,  pi  and  p2  are  collinear,  which  is  impossible  since  the  two 
rays  starting  at  po  and  52  with  direction  u  and  —v  do  not  intersect.  Thus, 
max(Ko(Fi),  ki(Fi))  and  Kmax(Fi)  are  strictly  greater  than  a  positive  constant 
for  any  o;  G  (0,+oo),  and,  by  scaling  the  plane,  this  constant  can  be  scaled  to 
a  value  greater  than  1. 


Fig.  4.  Example  where  ’If  >  0  for  any  a  >  0  (pod  x  u  >  0,  pod  x  u  >  0  and 
n  X  n  <  0). 
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3.3.  Curvature  control  problem  with  continuity 

We  consider  here  the  following  variant  of  the  curvature  control  problem; 

Given  two  points  M  and  N,  and  two  unit  vectors  u  and  v,  we  want 
to  find  two  quadratic  Bezier  curves  Fi  and  T2  joined  with  continuity 
(atp2  =  q^),  interpolating  the  two  points  M  and  N  (atpQ  and  q2 
respectively),  such  that  the  tangent  vectors  at  M  and  N  have  direc¬ 
tions  u  and  V  respectively,  the  maximum  curvature  of  the  two  curves 
is  smaller  or  equal  to  1,  the  distances  a  —  ||j>oPi||  0,'^d  /?  =  11^1^2  !l 
are  equal,  and  such  that  a  is  minimized. 

As  we  said  in  Section  3.1,  the  problem  is  equivalent  to  finding  the  smallest 
ot  G  (0, +00)  such  that  Fi  and  F2  are  connected  and  their  curvature  is 
everywhere  smaller  or  equal  to  1,  where  po  =  M,  52  =  Pi  =  Po  +  olu, 
qi  =  q2  —  OiVj  and  there  exists  p  G  (0, 1)  such  that  p2  =  =  PPi  +  (1  —  p)9i- 

The  curves  Fi  and  F2  are  connected  G^  if  and  only  if  the  two  signed 
curvatures  of  Fi  and  F2  at  p2  are  equal,  that  is,  by  Theorem  1, 

P^  X  pfp2  ^  qm  X  qf^ 

2ibiP2||3  "  2||g„«i||3  ’ 

when  the  triplets  of  points  (po,Pi,P2)  and  {qo^Qi,q2)  are  not  collinear.  We 
easily  get  that  pfp2  =  (1  -  p)pi^,  qm  =  PPT^ ,  pT^  -  Po^  -  Ci{u  +  v), 
Popi  =  au  and  qiq^  =  av.  Thus,  we  get  that  F  is  G^  if  and  only  if 

an  X  (1  -  p)(poq2  -  a(u  +  v))  ___  li(poq2  -  o((u  +  v))  x  av 
(l-/z)3||pigi||3  /i^llpigilP 


u  X  Poq2  —  au  X  v  poq2  XV  -  auxv 


{i~pY 


—  2/i^  +  =  0  where  ^  ^ — au  x  v  ^  ^  ^ 

Poq2  x{u-\-v) 

Standard  calculations  yield  that  the  equation  —  2p^  -f  ^  =  0  admits 
a  root  in  (0,1)  if  and  only  if  G  (—1/3,0).  We  can  easily  choose  pQ,q2,u 
and  V  such  that  ^  0  (—1/3,0).  Indeed  (see  Figure  4),  ^  >  0  for  any  u,  v 
that  are  on  the  same  side  of  po^2  (^-e.,  poq2  x  u  and  po^  x  v  have  the  same 
sign)  and  such  that  v  lies  in  the  small  wedge  defined  by  poqi  and  u  {i.e.,  uxv 
and  poq2  x  v  have  opposite  signs).  We  thus  proved  that  there  is  no  solution 
to  our  curvature  control  problem  for  a  set  of  non-degenerate  choices  of  the 
parameters  M,  N,  u  and  v. 

However,  when  a  solution  exists,  it  can  be  computed  as  in  the  previous 
section.  Indeed,  the  curvature  Kmax(rt)  be  expressed  as  a  ratio  of  poly¬ 
nomials  in  a  and  p,  and  the  inequality  «:Tnaa:(r’i)  <  1  reduces  to  a  polynomial 
inequality  of  degree  28  in  a.  Similar  remarks  hold  for  «:o(ri)  and  Ki(Fi). 
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§4.  Concluding  Remarks 

It  remains  open  to  solve  the  curvature  control  problem  when  the  length  of  the 
curve  is  to  be  minimized.  Another  interesting  approach  would  be  to  determine 
how  much  longer  than  optimal  our  curves  are.  Also,  we  would  like  to  consider 
the  case  when  the  data  consist  of  more  than  two  control  points.  Note  also 
that,  because  of  the  high  degree  of  the  equations,  it  is  not  clear  that  the 
solutions  presented  in  Sections  3.2  and  3.3  are  usable  in  an  interactive  curve 
design  context.  This  should  be  tested  with  an  implementation. 
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A  B-spline  Tensor  for  Vectorial 
Quasi-Interpolant 


Fabrice  Dodu 


Abstract.  The  aim  of  this  paper  is  to  introduce  new  techniques  and 
new  tools  for  vector  field  approximation.  We  do  so  by  building  the  equiva¬ 
lent  of  5-splines,  which  are  now  tensor  B-splines,  as  shown  below,  and  by 
appling  to  the  discretization  based  on  a  regular  grid  of  a  differential  op¬ 
erator  a  fundamental  solution  of  it,  as  done  for  polynomial  5-splines  and 
polyharmonic  5-splines  (see  [5,6]),  We  thus  obtain  quasi-interpolants  in 
the  vectorial  case  whose  properties  generalize  the  properties  of  the  quasi- 
interpolants  generated  by  using  5-splines.  All  this  is  done  in  the  case 
when  the  data  lie  on  a  regular  infinite  grid. 


§1.  Introduction 

Fluid  mechanics,  meteorology  and  more  and  more  other  applications  need 
approximate  functions  from  to  Given  some  discrete  vectorial  data, 
we  want  to  get  a  function  interpolating  or  approximating  the  data.  At  first 
glance,  we  may  think  of  doing  this  with  three  independent  approximations 
of  the  data  (one  for  each  component  of  the  data).  Of  course  this  can  be 
done,  but  it  usually  gives  poor  results  since  there  is  no  connection  between 
the  various  components  of  the  approximation  function,  while  the  applications 
may  require,  for  example,  a  divergence-free  (or  a  rotational-free)  function.  In 
order  to  take  into  account  this  kind  of  connection,  we  want  to  determine  the 
function  interpolating  the  data  and  minimizing  a  seminorm  over  all  vectorial 
functions  interpolating  the  data.  The  seminorm  which  is  minimized  is  based  on 
the  Helmholtz  decomposition  of  vector  fields  into  a  rotational  and  a  gradient 
part  (p  ||div-  ||rot-  IP).  This  will  be  presented  in  detail  in  the  forthcoming 
thesis  [2].  The  weight  p  is  introduced  to  allow  the  rotational  part  of  the  field 
to  dominate  the  gradient  part  and  so  in  the  so  obtained  function  [2]. 

In  order  to  determine  the  interpolating  vector,  we  need  to  solve  a  linear 
system  which  is  usually  large  (three  times  the  number  of  data)  and  badly 
conditioned.  This  is  why  in  this  paper  we  do  not  propose  to  interpolate, 
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but  instead  approximate  the  data  by  building  a  quasi-interpolant  based  on 
5-splines.  Note  that  if  are  vectors  and  if  we  want  to  get  a  vectorial 

function  S  such  that  S{x)  =  ^i{^)  we  do  in  the  scalar  case  with  B- 

splines  5^),  we  need  that  the  functions  Bi  are  matrix  and  not  scalar  functions 
as  in  the  case  of  scalar  approximation  in  R,  or  IR,^^.  This  is  why  the  tools  we 
build  are  tensors. 

Now,  in  order  to  build  this  “5-spline  Tensor” ,  we  will  use  the  same  strat¬ 
egy  as  for  polynomial  and  polyharmonic  5-splines:  we  will  discretize  the 
differential  operator  Pjn,p  (5)  defined  in  [2]  and  apply  to  this  discretization  a 
fundamental  solution  of  Pjn,p  (D),  thus  obtaining  a  kind  of  approximation  of 
the  Dirac  tensor  6 13.  All  this  is  done  on  a  cardinal  grid  (i.e.  a  regular  infinite 
grid). 

For  our  three-dimensional  problem  [2],  we  choose  the  3x3  differential 
matrix  defined  by 

Pm,p  {D)  -  (-1)"^  A— 1  [pVdiv  .  -rot  (rot.)] , 

where  div  is  the  divergence  operator,  V  is  the  gradient  operator,  rot  is  the 
rotational  operator  and  p  is  an  arbitrary  positive  parameter. 

We  remark  that  if  p  =  1,  then  (5)  =  (—1)’^  A”^  I3,  where  1 3  is  the 
identity  matrix  of  R.^,  so  we  obtain  three  independent  operators  (one  for  each 
component),  each  one  being  as  in  ([5]). 

In  the  first  part  of  this  paper  we  give  the  construction  of  a  discretization 
of  Pm,p  [D).  In  the  second  part,  we  define  polyharmonic  5-spline  tensors  and 
give  the  main  properties.  In  the  last  part,  we  study  the  associated  vector 
quasi- interpolants,  and  in  particular  prove  their  Pit-reproduction. 


Notation:  Let  m  be  a  integer  with  m  >  2.  P„  denotes  the  set  of  polynomial 
with  variable  in  R^  of  total  degree  at  most  n  and  =  Pn  x  IPn  x  IPn*  D 
denotes  the  set  of  distribution  of  R^  and  V  —  D  xD  xD  .  Let  /  :  R^  1 — >  R 
be  a  scalar  function  of  a  three  dimensional  variable.  Let 

/  ^2/3  -  ^3/2  \ 
rot/  =  ^3/1  -difs 

\dJ2-d2hJ 


be  the  rotational  operator,  and  let  div/  =  <9i/i  +  52/2-h^3/3  be  the  divergence 
operator.  Let  A’^  =  .  T  (g)  is  the  Fourier  transform  of  the 

function  (or  distribution)  g.  For  all  C  G  R^,  sin  (C)  denotes  the  vector  defined 
by  VI  <  2  <  3,  sin(()j  =  sin(Ci).  S  denotes  the  Dirac  distribution.  Let 

Vm+i  be  the  function  such  that  :  Vm+i  =  All  tensors 

will  be  denoted  with  bold  capital  letters  (i.e.  X,...).  Let  h  >  0,  and  i  be 
such  that  :  =  —1.  (ei)^_j  denotes  the  canonical  basis  of  R^.  For  fc  G  IN 
and  i  =  1,2,  or  3,  denotes  the  A;th  divided  difference  of  step  h  defined 

by  /)  M  +  5«i)  -  /  (^  -  lei).  Sl  i  =  °  ^l,i-  We  use 

standard  multi- index  notations.  If  o;  G  IN^  and  |a|  =  0:1-1-02-1-0:3,  then 


a«2 


{axi)“i  (ax2)“2  (ax3)“3  ’ 


L  1  3  * 
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§2.  Discretization  of  Pm,p(D) 

The  goal  of  this  section  is  to  define  an  approximation  of  Pm,p  (D)  which  should 
reproduce  polynomials  of  largest  possible  degree  (in  the  sense  defined  below). 
We  now  introduce  the  Pfc-exactness  of  an  operator,  and  a  Pfc-exact  approxi¬ 
mation  of  D°‘. 

Definition  1.  An  operator  E'  is  said  to  be  a  Pfc-exact  fresp.  'Pfc-exact)  ap¬ 
proximation  of  an  operator  E  iff  for  any  function  f ,  E'  f  is  a  linear  combination 
of  translates  of  f  and  for  any  p  G  Pfc  (resp.  p  £Vk),  E' p  =  Ep. 

Remark.  is  a  P 2 -exact  approximation  of 

Definition  2  .  Let  Dj^  ^  ^  be  the  operator  defined  by 


p,i  _  (~1) 


{k\f 


fc=0 


2h  (2fe  +  l)! 


For  anj  ol  G  we  deline  an  approximation  of  step  h  and  level  N  of  to  be 
the  operator  defined  by 


^h,N  —  °  (-^^,2,^)  °  {^h,3,N) 


\CX3 


In  the  same  way  Bm,p  (-C^/i,iv)  denotes  the  approximation  of  Pm,p  (D). 

Proposition  3.  Let  o:  G  IN^  and  be  defined  as  above.  Then,  for  all 

mappings  f  from  to  P,  there  exists  real  constants  (c.y)  such  that 

^  (■Dfc.jv/)  =  f  ^  H  )  -^  (/)  ■ 

V  |7|=(2W+l)|a|  / 

Proof:  For  every  1  <  i  <  3, 

P  K,i/)  (C)  =  (exp(27r/iCi)  “  exp  •  P  {f)  (C)  =  2ism{hC)i- P  (f)  (0 

By  applying  the  Fourier’s  transform  to  the  {2N  +  1)  \a\  centered  differences 
of  we  obtain  the  result.  □ 

Proposition  4.  Let  a  G  IN^.  Then 
i)  is  a  P|a|+2N-f-i'exact  approximation  of  D“. 

a)  Pm,p  {Dh,N)  is  an  approximation  of  Pm,p  {D)  which  is  P2m+2N+i-exact. 

Proof:  We  prove  by  induction  on  \a\  that  for  all  /  G  (72'^+2+laj^  there  exist 
c-y  real  constants  such  that 

=  £>“/+  Y. 

|7|=2iV+2+|al 
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In  the  following,  for  any  7  G  IN^,  rf-y,  are  real  constants.  The  proof  for 
\a\  =  1  is  due  to  Steffensen  in  ([7]).  Suppose  that  it  is  true  for  jQ:|  =  m.  Let 
|a|  =  m  +  1,  /  G  (72A^+2+m+i  yg  choose  i  such  that  Oj  ^  0.  Then 

dinI  =  Ki,N  {Dt:N“f) 

\  \'r\==2N+2+m 

|7|=2N+3  |7j=2Ar+3+m 

=  £>“/+  E  «-»/(«■ 

|7|=2N+3+m 

Thus,  we  get  the  result  if  we  note  that  Pm,p  {D)  is  a  differential  operator  of 
degree  2m.  □ 

§3.  J3-spline  Tensors  Associated  with  Pm,p(D) 

Lemma  5.  Let  Vm+i  be  such  that  (—1)’^'^^  =  S  in  D  .  Let 

X  =  tir 

P 

/  (^2,2  +  ^3,3)  ^m+l  ~^l,2'^m4-l  ~^l,3'^Tn+l  \ 

+  (  —  1)”^  I  —^l,2^Tn-\-\  ipl,!  +  ^3,3)  ^m+1  ~^2,3'^rn  +  l  1  • 

\  -dl^Vm+l  -^2,3^m+l  (^1,1  +  ^2.2)  ^m+l  / 

Then  Pm, p  {D)X  =  6h. 

Definition  6.  Let  X  be  a  fundamental  solution  tensor  ofPm,p  (P)  X  = 

We  define  the  level  N  and  step  h  B-spline  tensor  associated  with  the  operator 
Pm,p  [D),  to  be  the  tensor  defined  by  Pm,p  {Dh,N)  X. 

Remarks. 

a)  B'^’^  is  not  a  symmetrical  tensor. 

b)  If  m  is  even,  we  can  prove  the  existence  of  a  differential  matrix  R^,p  {D) 

of  degree  m  such  that  :  Pm,p[D)  —  Rrn  p{D)  and  we  can 

construct  a  symmetrical  tensor  p  deBned  by 


C 


m,p 

h,N 


=  h^R^^p{Di„N)  XR^,„{Dh.N)- 


Lemma  7.  The  elements 
tributions  on  IR^. 


of 


l<t,j,<3 


are  in  the  set  of  tempered  dis- 
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Denoting  by  T  j  the  Fourier  transform  applied  to  each  element  of 
the  tensor  we  obtain  the  following  theorem: 

Theorem  8.  With  the  above  notation, 

i)  = 

ii)  V7  e  JN^  |7|  <  +  1  ;  (b”;?  (0))  =  0, 

iii)  VI  <j,l<  3,  (b"^)  ,  (t)  =  O 

\  ^3,1  |i|->+oo 

iv)  Vfc  €  Til,  V7  e  ]N^  |7|  <  2m  -  1  ;  (b™^)  (i)  =  0, 

V)  = 


Proof;  For  i)-ii),  we  use  Definition  2  and  Proposition  3.  By  the  Taylor 
expansion  of  sin,  near  0  we  obtain  T  (0  =  ^  '-^3  +  Q  (0?  where  Q  (C) 

is  such  that  for  all  i  and  j  such  that  1  <  j  <  3,  there  exist  real  coefficients 
such  that  ,  (C)  =  E|a|>2w+2m+4‘=’ei^pr<|I^-  ^hus,  we  obtain  the 
results. 

We  prove  iii).  By  using  the  above  expression  of  Q,  D^T 


integrable  near  0  if  and  only  if  2iV  +  2  —  I7I  >  —3.  Using  Proposition  3 
and  Lemma  5,  there  exist  real  coefficients  and  for  all  %  and  j  such  that 


fc=l  |al=2m(2iV+l) 


As  a  consequence,  the  elements  of  T  are  bounded  at  infinity  by  ratio¬ 
nal  fractions  of  degree  -2m.  ^  are  integrable  near  infinity  if  and 

only  if  -2m  -  I7I  <  -3.  Then,  ^  integrable  in  iff  2N  -j- 

2  -  |7|  >  -3  and  -2m  -  I7I  <  -3.  Thus,  =  o  {\t\-^^-^) , 

for  all  1  <  i,  /  <  3.  Using  [4],  the  last  expression  may  be  strengthened  to 

\  ’  /  |t|-t+CXD 


We  now  establish  iv).  For  all  j  in  2Z^,  7  in  IN^  and  positive  h,  we  have 
D->T  (b™;?)  (i)  =  0  iff  WT{P„,p  {Dk,N))  (0)  =  0.  Now  by  using  Proposi- 
tion  3,  we  have,  for  all  tensors  Y  and  all  7  in  IN^  such  that  I7I  <  2N  +  2m  -|- 1: 


D''  {T{Pn..p  {Dk.n)  i"))  (0)  =  (jP(P„,p  (B) )  (0)  -TiY)  (0) ). 
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Furthermore,  F{Pm,p(D))  is  a  polynomial  matrix  of  degree  2m  so  V|7|  < 
2m  -  1  ,  {Pm,p  {D))  (0)  =  0.  This  gives  iv). 

Finally,  we  prove  v).  According  to  the  Fourier  transform’s  properties,  v) 
is  equivalent  to 

^  (0  =  ^ (ft))  iO  =  ^  {Kn)  l'*  •  0 .  VC  e 

Now,  from  the  definition  of  we  derive  for  any  C  in  IR^, 

/■  (b™;;)  {h-0  =  T  (£>1,^,)  x)  (h  ■  C) 

=  ){h-o-r  (p„,p) 

Using  Proposition  3,  we  have 

(Di,n))  {h-o  =  />'’"P(P^,,  {Dh.n))  (C)  , 

P(P„,^  (P))-'  (ft  •  C)  =  ft-'">P{Pm,,  (D))-'  (C)  , 

and  thus  VC  G  IR^, 

T  (b™J;)  (ft  .  C)  =  P(Pm,p  (Dh.N))  (0  ■P{Pm.p  (D))-'  (C) 

=  ft-^.p(B;:;;;)(C).  □ 


Remarks. 

a)  These  polyharmonic  B-spline  tensors  may  be  considered  as  a  regularisa- 
tion  of  the  Dirac  distribution  tensor  SI3. 

b)  We  obtain  the  same  properties  with  C^’^. 


§4.  Associated  Vector  Quasi-Interpolant 

Given  vectorial  data  in  this  section  we  define  a  vector  field  S  ap¬ 
proximating  the  data  (ih,  (i.e.  such  that  S{jh)  ~  Zj  for  all  j  in 

7Z^),  by  using  the  above  defined  tensor  R-splines.  This  vector  generalizes  the 
polyharmonic  R-spline  quasi- interpolant  (see  [5,6]). 

Definition  9.  Let  be  the  level  N  and  step  h  B-spline  tensor  associated 
with  Pm,p(D).  For  all  j  G  71^,  let  zj  G  and  let  z  =  (zj).^r^3-  Then 
the  vector  quasi-interpolant  of  step  h  and  level  N  associated  with  the  operator 
Pm,p  {D)  and  the  (jh,  data,  is  the  vector  function  defined  by 

^r,N-,z  =  X/  (■  “ 

jeTZ^ 


B-spline  Tensors  for  Vectorial  Quasi-interpolant 


207 


Theorem  10.  Let  I  =  inf  {2N  +  1, 2m  -  1},  and  suppose  there  exists  p  ^Vi 
such  that  for  all  j  in  7L^ ,  Zj  =  p(jh).  Then,  —  p.  We  say  that  the 

vector  quasi-interpolant  of  step  h  and  level  N  reproduces  Vi-  As  a  particular 
case,  reproduces  7^2711-1  • 

Proof;  The  proof  is  based  mainly  on  Poisson’s  egality  and  Theorem  8.  It 
follows  along  the  same  lines  as  the  proof  in  [3]  in  the  scalar  case.  □ 

Remarks. 

a)  2m  ~  1  is  the  maximal  order  of  reproduction. 

b)  We  obtain  the  same  properties  if  we  dehne  the  vector  quasi-interpolant 
using  by  the  symmetrical  tensor 

c)  A  similar  problem  is  studied  in  IR^  in  ([1]),  where  using  another  discretiza¬ 
tion  of  P2,p  {D),  the  authors  obtain  a  vector  quasi-interpolant  which  is 
(Fi  (F^)) ^-reproducing. 

Theorem  11.  Let  f  be  a  vector  function  of  C  (F^)* -class  and  all  partial 
derivatives  of  order  k  being  bounded  over  F^.  Let  /  above  defined 

vector  quasi-interpolant  associated  to  the  (jh,  f  {jh)^  data.  Then 


SUPII^S^]^;;  W-/WII  = 


O  (/i*’) 


ifk  <2N  +  l, 


[0(/l2^+2|in(/i)|)  ifk>2N-{-2. 


Proof:  The  proof  follows  that  of  Theorems  4.11,  5.1  and  5.6  in  [3].  □ 
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Analysis  of  Scalar  Datasets  on 
Multi-Resolution  Geometric  Models 


Alexandre  Gerussi  and  Georges-Pierre  Bonneau 

Abstract.  Recently,  multi-resolution  methods  based  on  non-nested 
spaces  were  introduced  to  allow  the  visualization  and  approximation  of 
functions  defined  on  irregular  triangulations  [3,4,5].  This  paper  comes  back 
to  these  methods  and  shows  more  precisely  how  the  subdivision/prediction 
/correction  scheme  of  ordinary  wavelet-based  multi-resolution  analysis 
(MRA)is  also  present  in  that  framework.  As  an  illustration,  it  is  demon¬ 
strated  how  it  can  be  applied  in  two  of  the  classical  issues  of  MRA:  com¬ 
pression  and  level-of-detail  editing.  We  also  show  that  the  framework 
can  be  used  for  the  analysis  and  approximation  of  scalar  data  defined  on 
meshes  with  arbitrary  topology,  thus  extending  our  previous  results  in  the 
plane  and  the  sphere.  Here  again,  the  link  with  the  corresponding  classical 
multi-resolution  scheme  of  [6]  as  well  as  decimation  methods  is  made. 


§1.  Introduction 

In  the  last  few  years,  the  problem  of  simplifying  huge  3D  triangular  meshes, 
for  the  purpose  of  e.g.,  visualization,  transmission  or  storage,  has  received 
considerable  attention.  Among  those  works,  two  major  approaches  can  be 
found.  In  the  case  of  regular  meshes,  the  use  of  a  wavelet-based  framework 
has  proven  to  be  a  powerful  solution  [6,9,14],  On  the  other  hand,  when  meshes 
are  not  regular,  the  approach  has  been  to  simplify  the  mesh  by  applying 
a  sequence  of  elementary  geometric  simplification  operations,  such  as  vertex 
removals,  edge  collapses  or  triangle  collapses,  the  order  of  removal  being  driven 
by  a  greedy  algorithm  [1,8,11].  We  refer  to  this  latter  approach  as  a  decimation 
approach. 

This  paper  is  concerned  with  the  simplification  of  data  that  is  defined 
on  a  surface  by  means  of  a  triangulation.  This  topic  is  closely  related  to 
surface  simplification,  since  a  triangular  mesh  can  be  seen,  at  least  locally,  as 
the  graph  of  a  piecewise  linear  function  supported  by  a  triangulation.  Here 
again,  when  the  surface  is  well-known,  for  example  a  plane  square,  a  sphere, 
a  cylinder,  etc.,  several  wavelet  based  approaches  have  been  employed  [9,14], 
with  regular  underlying  meshes. 
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In  [3,4,5]  the  concept  of  non-nested  MRA  was  introduced  and  applied 
to  the  approximation  and  progressive  visualization  of  piecewise  constant  or 
linear  functions  defined  on  arbitrary  planar  or  spherical  meshes.  In  Section  2, 
we  investigate  the  relationship  between  the  non-nested  framework  and  irreg¬ 
ular  subdivision.  Examples  of  compression  and  level-of-detail  editing  in  that 
framework  are  given  in  Section  3. 

Section  4  focuses  on  the  approximation  and  visualization  of  functions 
defined  on  triangular  meshes  with  arbitrary  topology.  Here,  an  additional 
difficulty  is  that  the  surface  which  supports  the  function  is  also  altered  by  the 
approximation  process.  Like  in  the  case  of  surface  simplification,  a  wavelet 
approach  can  successfully  be  applied  when  the  original  mesh  has  subdivision 
connectivity  [6,12].  Otherwise,  in  an  irregular  setting,  a  decimation  approach 
is  usually  employed,  and  the  function  is  approximated  during  the  simplifica¬ 
tion  process  [1,11].  We  show  how  to  apply  our  framework  to  functions  defined 
on  such  general  meshes.  We  will  see  that  it  makes  the  link  between  the 
wavelet-based  approach  available  for  subdivision  surfaces  and  the  decimation 
model.  As  in  our  previous  papers,  the  function  in  its  multi-resolution  form 
is  described  by  a  coarse  approximation  defined  on  the  simplified  mesh,  and 
a  sequence  of  detail  coefficients  that  are  used  for  the  reconstruction  of  the 
function  on  every  LOD  up  to  the  original  mesh.  Our  scheme  is  fully  bijective: 
The  function  multi-resolution  representation  has  the  same  size  as  the  original 
one.  The  approximation  process  performs  L2  approximation  of  the  data,  but 
other  types  of  approximation  are  also  possible. 

§2.  Non-Nested  Framework  and  Irregular  Subdivision 

In  Section  2.1  we  briefly  review  the  non-nested  decomposition  scheme  de¬ 
scribed  in  [3,5].  Section  2.2  makes  the  link  with  the  notion  of  irregular  subdi¬ 
vision. 

2.1.  Decomposition  scheme 

For  simplicity,  every  space  is  supposed  to  have  finite  dimension.  Let  (7  be  a 
measurable  domain  and  V'',  i  =  0, . . . ,  A^,  a  sequence  of  subspaces  of  X2(n). 
These  spaces  do  not  have  to  be  nested  but  will  in  general  be  “growing”  in  the 
sense  that  dim(V’*)  <  dim(V'*'^^).  Now  let  /  =  /yv  be  a  function  in  the  finest 
subspace  .  In  classical  MRA,  the  spaces  are  nested  and  the  link  between 
and  is  made  by  taking  a  complementary  space  of  in 

that  is 

=  v^-^ew^-\ 

Now  if  we  write  /^r  =  /n-i  +  9n~i  according  to  the  space  decomposition, 
/tv-i  can  be  seen  as  an  approximation  of  /at  in  and  qm-i  as  the  detail 

needed  to  recover  the  original  function  from  its  approximation.  By  repeating 
the  decomposition,  one  obtains 

fN  =  fo+9o-\ - 1-  9N-1 

which  corresponds  to  the  space  decomposition  =  1/^0  0^“^ 


Analysis  of  Scalar  Datasets 


211 


Notice  that  in  this  case  fi  =  Pvi{fi+i),  where  Pyi  is  the  projector  on  V*  with 
direction  W\  We  return  to  the  general  case  and  suppose  that  a  linear  “pro¬ 
jector”  P*  :  ->  y*  is  given.  To  avoid  technical  details,  these  projectors 

are  required  to  be  surjective,  but  the  results  in  this  paragraph  also  hold  if 
this  is  not  the  case.  Let  be  the  kernel  of  P\  and  be  a  complementary 
space  of  in  We  now  observe  that  the  restriction  of  P*  to  V'’  is  a 

bijective  operator,  having  the  same  range  as  P\  Thus  if  fi  =  P*(/i+i) 

Qi  Q^{fi+i),  where  is  the  projector  on  (defined  by  the  choice  of  y*), 
the  following  reconstruction  formula  holds; 

/i+i=Inv(P^,)(/i)  +  Si-  (1) 

Again,  by  iterating  this  decomposition,  we  obtain  a  coarse  approximation 
/o  =  P°  o  •  •  •  o  P^~^{fN)  and  “detail”  functions  go,...,  9n-i- 

We  now  take  a  look  at  the  reconstruction  process.  Denote  by  the 
inverse  of  P^^^,  :V^  C  K+i.  The  complete  coarse-to-fine  reconstruc¬ 

tion  formula  is  obtained  by  iterating  the  reconstruction  formula  (1): 


2.2.  Approximating  spaces,  subdivision  spaces  and  scaling  spaces 

We  are  going  to  see  that  the  previous  scheme  can  actually  be  considered  in 
two  different  ways.  Until  now,  it  was  implicitly  assumed  that  the  spaces 
y^  were  playing  the  role  of  the  scaling  spaces  in  classical  MRA.  Under  this 
assumption,  we  conceptually  have  a  really  non-nested  framework;  if  the  spaces 
were  nested,  the  operators  would  be  the  identity  (formally  injecting  in 
y*+^).  However,  we  will  not  call  them  scaling  spaces  but  approximation  spaces, 
and  keep  the  term  “scaling”  for  other  spaces  that  are  going  to  be  defined  below. 

In  the  non-nested  framework,  one  loses  the  notion  of  subdivision  (or  cas¬ 
cade  algorithm).  However,  looking  at  things  slightly  differently  allows  subdi¬ 
vision  to  fit  into  the  non-nested  scheme.  To  show  this,  consider  the  operators 
S'*  as  subdivision  operators,  and  call  the  spaces  y*  subdivision  spaces  accord¬ 
ingly.  This  means  that  we  start  from  a  function  /o  E  y*^  and  iteratively 
subdivide  it  into  fi,  /2 ■ /v  using  the  formula  fi^i  —  S*(/i).  The  notion 
of  subdivision  used  here  is  very  general.  In  that  context,  classical  regular 
or  semi-regular  subdivision  schemes  would  give  rise  to  a  nested  sequence  of 
subdivision  spaces.  But  completely  irregular  schemes  would  require  the  non¬ 
nested  framework  to  be  fitted  in.  The  use  of  non-nested  MRA  was  introduced 
in  [2],  and  was  later  applied  to  triangular  schemes  in  [3,4,5].  Recently,  another 
approach  on  general  irregular  schemes  was  proposed  in  the  work  of  Sweldens 
and  Guskov  [10,13]. 

We  now  define  the  scaling  spaces.  Like  in  classical  MRA,  they  are  the 
spaces  containing  the  limit  functions  resulting  from  the  subdivision  process. 
Since  very  little  is  known  about  the  convergence  of  such  schemes,  we  won’t 
push  the  subdivision  to  infinity,  but  restrict  ourselves  to  an  integer  N.  This 
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makes  sense  since,  when  using  such  a  framework,  one  usually  starts  from  an 
initial  triangulation  which  is  coarsened  to  Tq.  This  contrasts  with  the 
traditional  approach  in  subdivision  where  one  starts  from  a  base  mesh  and 
subdivides  it  according  to  a  systematic  rule. 

The  scaling  spaces,  for  z  =  0, . . . ,  iV,  are  defined  as 

yi,N  ^  yl.N  I  g 

where  and  =  5^”'  o  ■■  ■  o  S'.  This  operator  carries 

functions  in  V*  through  N  —  i  subdivision  steps  to  functions  in  . 

Notice  that,  for  every  z,  1/*’^  C  and  moreover,  that  C 
Now  fix  a  basis  (</?*)  for  and  (V'*)  for  W\  If  (a|)  and  (6|)  denote  the  coor¬ 
dinates  of  fi  £  F*  and  pj  £  with  respect  to  these  basis,  the  reconstruction 
formula  can  be  re-written  as 


k  i=j  k 

for  any  j  =  0, . . . ,  N,  which  corresponds  to  the  decomposition  of  /jv  in 
YN,n  _  prN  fusing  the  same  notations  as  above: 

Although  formally  identical,  considering  the  F*’s  as  approximation  or 
subdivision  spaces  changes  the  aspect  of  several  questions.  For  example,  in 
the  problem  of  error  measure  in  the  context  of  approximation  spaces,  we 
are  interested  in  H/at  —  /i||,  whereas  in  the  other  context  we  are  looking  for 

§3.  Application  to  Data  Compression  and  LOD  Editing 

In  this  section  we  show  how  the  framework  can  be  used  on  functions  defined 
over  irregular  triangulations  to  achieve  data  compression  and  LOD  editing, 
which  are  both  standard  applications  of  MRA.  The  context  here  is  the  planar 
or  spherical  setting  of  [4,5],  only  the  filters  need  to  be  changed.  Indeed,  the 
analysis  operator  used  in  those  papers  was  the  orthogonal  projector  (the  goal 
being  progressive  visualization).  However,  for  compression  it  is  often  useful  to 
know  in  advance  the  error  between  the  original  function  and  its  approximation, 
in  terms  of  the  wavelet  coefficients  that  were  used  in  the  reconstruction.  This 
will  be  achieved  by  designing  new  filters. 

3.1.  Isometric  subdivision 

In  order  for  the  detail  coefficients  to  have  the  error  measure  property,  the 
synthesis  operator  is  required  to  be  an  isometry.  Indeed,  suppose  that 

•  5*  :  F*  ^  F*"^^  is  an  isometric  operator,  Vz  =  0, . . . ,  A  —  1, 

•  the  complementary  spaces  F*  are  chosen  orthogonal  to  the  IF*,  Vi  — 
0,...,iV-l. 
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When  the  latter  condition  is  fulfilled,  we  say  that  we  are  in  a  semi-orthogonal 
framework.  If  /at  denotes  the  original  function,  then,  according  to  the  global 
reconstruction  formula  (2),  the  quantity  \\fN  —  /o’^lP  which  measures  the 
contribution  of  the  correction  steps  in  the  reconstruction  process  is 

o  •  •  •  o  S^{go)  -I - h  S^~^(gN~2)  + 

Because  of  semi-orthogonality,  it  is  equal  to 

O  .  •  •  O  S^{go)  +  •  •  •  + 

and  because  is  an  isometric  operator,  we  can  factorize  and  remove  it 

from  the  first  term  above,  and  then  iterate  the  operation  to  get 

i-O 

Notice  that  even  if  the  subdivision  operators  are  not  isometric, 

II/n  -  •  •  •  ll^'fllsof  +  ■  •  •  +  l|S"-^ini9N-2||"  +  lisjv-if 

still  holds  in  the  semi-orthogonal  setting.  Let  fN  denote  the  partially  re¬ 
constructed  function,  and  suppose  in  addition  that  we  axe  in  an  orthonorma! 
framework,  that  is,  the  functions  -0*  form  an  orthonormal  basis  of  W\  Pro¬ 
ceeding  as  above  leads  to 


ii/n  -  /jvip  =  E 

i=0  k 

where  the  6j.’s  are  the  wavelet  coefficients  and  e\  equals  1  whenever  6J.  is  taken 
in  the  reconstruction  and  0  otherwise.  Consequently,  in  this  setting  we  have 
an  error  measure  in  terms  of  wavelet  coefficients. 

3.2.  An  isometric  subdivision  operator 

The  idea  behind  this  construction  is  that  the  corresponding  analysis  oper¬ 
ator  should  have  reasonable  approximation  quality,  which  seems  intuitively 
required  to  achieve  compression.  Accordingly,  the  projection  operator  used 
in  [4,5]  is  taken  as  a  starting  point,  the  problem  being  to  approximate  it  by 
means  of  an  isometry.  Let  P*  :  ^  be  that  operator.  The  first  step 

is  to  find  the  matrix  of  P*  with  respect  to  some  orthonormal  basis  and 

(e*)  of  and  V^.  Now  let  UDV  be  the  singular  value  decomposition  of 
that  matrix.  This  decomposition  admits  the  following  interpretation: 

•  V  is  the  matrix  of  an  isometric  operator  of  in  the  basis  since 

it  is  unitary  and  the  basis  is  orthonormal. 
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•  D  is  a  ^  X  c  matrix  [t  <  c)  whose  diagonal  coefficients  are  all  non- negative 
and  <  1.  The  diagonal  matrix  formed  by  the  positive  entries  of  D  is  the 
matrix  of  a  bijective  operator  mapping  V'’  onto  Ran{P*). 

•  Like  V",  the  matrix  U  is  an  isometric  operator  of  V^. 

Let  D  be  the  matrix  obtained  by  replacing  every  positive  diagonal  element  of 

by  1.  This  amounts  to  turning  the  bijective  operator  above  into  an  isometric 
one,  and  thus  UDV  is  the  matrix  of  an  operator  P*  whose  corresponding 
subdivision  operator  is  isometric.  A  few  remarks  can  be  made  about  this 
construction: 

•  P*  does  not  depend  on  a  particular  choice  of  orthonormal  basis. 

•  P*  is  not  the  best  isometric  approximation  of  P*  in  terms  of  L2  norm 
of  operators,  but  it  can  be  shown  that  it  is  the  best  with  respect  to  the 
Frobenius  norm. 

•  The  diagonal  coefficient  of  D  are  by  definition  the  cosines  of  the  angles 
between  the  spaces  and  (see,  e.g.^  [7]  Chapter  1).  In  the  nested 
case,  they  would  all  be  equal  to  1,  and  the  corresponding  subdivision 
operator  would  be  the  identity. 

•  This  method  could  be  used  to  approximate  operators  by  means  of  simi¬ 
larities,  by  replacing  the  entries  of  D  by  an  appropriate  scalar  a  instead 
of  1.  Although  better  in  terms  of  approximation  quality,  this  leads  to  bad 
visual  results  since  the  resulting  subdivision  operator  doesn’t  reproduce 
constants  if  a  ^  1. 

Notice  that  in  the  context  of  [4,5],  this  approximation  is  always  computed 
locally,  leading  to  a  global  algorithm  in  linear  time. 

3.3.  Examples 

As  it  is  mentioned  in  the  beginning  of  this  section,  these  examples  were  created 
using  the  setting  described  in  [3,4,5];  the  reader  is  invited  to  look  there  for 
details.  The  initial  triangulation  is  completely  irregular,  generated  by  random 
vertex  insertion.  In  Figure  1,  a  piecewise  linear  setting  is  used  for  LOD  editing. 
The  function  is  edited  at  a  coarse  resolution,  by  pulling  values  up  (— >  white) 
at  some  vertices,  and  then  adding  detail  coefficients  back.  Figure  2  shows  an 
example  of  data  compression  when  the  approximation  spaces  are  spaces  of 
piecewise  constant  functions  over  triangulations  generated  from  the  original 
one  by  homogeneous  decimation.  This  last  setting  is  the  full  generalization  of 
Haar  wavelets  to  irregular  grids,  as  it  would  lead  to  them  in  the  regular  case. 

§4.  Scalar  Datasets  on  Irregular  Meshes  with  Arbitrary  Topology 

In  this  section  we  describe  how  the  non-nested  framework  can  be  used  to 
handle  scalar  attributes  defined  on  meshes  with  any  topology.  As  a  starting 
point,  we  take  a  multi-resolution  decimation  model,  based  on  the  vertex- 
removal  (VR)  operation  to  simplify  the  geometry.  This  means  we  assume 
that  an  initial  fine  mesh  is  given  along  with  its  associated  sequence  of  VR’s 
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iii.  50  points  edited  iv.  Reconstruction 


Fig.  1.  Picture  design  using  level-of-detail  editing. 


Original:  50000  triangles  Compressed  to  5%,  6%  error 


Fig.  2.  Compression  of  a  piece-wise  constant  function. 

that  can  be  progressively  applied  to  decimate  the  mesh  down  to  a  base  mesh. 
In  addition,  we  suppose  that  the  scalar  attributes  are  defined  by  means  of 
piecewise  constant  and/or  linear  functions  parameterized  on  the  initial  mesh. 
In  what  follows,  PI(V)  denotes  the  polygon  of  influence  of  a  vertex  V;  it  is  the 
polygonal  area  delimited  by  the  1-neighbours  of  V.  In  order  to  apply  the  non¬ 
nested  framework,  approximation  spaces  and  approximation  operators  need 
to  be  defined. 

4.1.  Local  mapping 

Let  M\  i  =  No, . . .  ,N,  denote  the  triangular  mesh  consisting  of  i  vertices  (the 
original  mesh  after  N  —  i  VR  operations).  Let  !F(M^)  be  the  space  of  real¬ 
valued  functions  defined  on  M*,  and  C*  (resp.  O)  be  the  subspace  of  functions 
oiT{M^)  that  are  piecewise  constant  (resp.  linear)  on  each  triangle  of  M*.  We 
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refer  to  these  spaces  as  the  upper  approximation  spaces.  Each  VR  alters  locally 
the  surface,  thus  functions  of  and  T{M^)  are  defined  on  different 

domains.  To  define  an  approximation  problem,  a  common  parameterization 
for  these  functions  is  required.  To  that  end,  we  cissume  that  for  every  VR  of 
vertex  Vj,  a  local  one  to  one  projection  If*  of  Pl(Vt)  onto  a  plane  is  also  known. 
The  reader  can  refer  to  [8]  for  a  study  of  the  existence  and  determination  of 
such  a  projection.  We  use  II*  to  consider  the  change  of  parameterization 
W  :  M*  — ►  as  the  mapping  defined  by  IT*  o  iJ*  =  IT*  over  Pl(Vt)  and  by 

the  identity  outside. 

4.2.  Scaling  spaces  and  data  decomposition 

Let  K  stand  for  C  or  C.  To  a  function  f  £  we  associate  the  function 
and  let  C  be  the  space  of  all  these  functions. 

This  can  be  iterated:  The  local  mappings  also  define  a  global  mapping  from 
the  base  mesh  to  the  mesh  M*  by  o  •  •  *  o  for  each  i  = 

A^o  +  1)  •  •  • )  -V.  This  allows  to  define  the  approximation  spaces  from  the 
upper  approximation  spaces  as 

iC'  =  {fo  I  /  €  r}  C 

The  second  step  is  to  define  the  operators  P'  :  K,'^^  — ♦  K'.  Fortunately, 
working  directly  in  the  approximation  spaces  is  not  required:  Because  they  are 
isomorphic  (by  construction)  to  the  upper  approximation  spaces,  it  suffices  to 
define  some  operators  P*  :  — >  /C*,  and  for  the  purpose  of  visualization, 

the  least-square  projection  operators  will  be  used.  The  operator  P*  is  thus 
defined  by 


p'  ■■  fi+i  =  fi+i  °  ^  o 

/i+i  e  defines  /,'^i ’’  to  which  we  apply  one  decomposition  step  to  get 
an  approximation  /j  G  /C*  and  detail  coefficients  (1  in  the  linear  case,  and  2 
in  the  constant  case).  The  entire  operation  is  then  repeatedly  applied  to  fi, 
fi-iy  etc.  Consequently,  in  practice,  everything  happens  in  the  upper  approx¬ 
imation  spaces  which  have  a  much  simpler  structure  than  the  corresponding 
approximation  spaces. 

4.3.  Results  and  remarks 

The  output  of  this  algorithm  is  a  coarse  function  defined  on  the  base 
mesh  and  a  list  of  detail  coefficients  that  allow  the  exact  reconstruction  of 
the  original  function  through  the  hierarchy  of  LCDs.  Figure  3  shows  some  re¬ 
sults  in  the  linear  setting.  In  these  examples,  the  geometric  criterion  guiding 
the  decimation  priority-queue  is  simply  the  distance  from  the  1-neighbours  of 
a  candidate  vertex  to  their  least-square  approximation  plane.  On  the  upper 
right  snapshot,  we  see  the  drawback  of  a  geometric-only  driven  priority  queue: 
Some  quasi-planar  areas  on  the  object  have  been  severely  decimated,  leading 
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Original:  32000  vertices  Geometric  criterion:  3000  v. 


Geom.  &  data  criterion:  3000  v.  Same  without  vertices 

Fig.  3.  Bracket:  linear  approximation  examples. 

to  a  quite  coarse  approximation  of  the  function.  The  lower  snapshots  show 
the  result  using  the  geometric(|)  and  data(|)  based  criterion.  The  resulting 
approximations  are  better,  but  the  geometry  presents  some  visible  deforma¬ 
tions  (this  is  also  partially  due  to  our  simple  geometric  criterion).  Finding  a 
compromise  in  an  automated  way  seems  to  be  a  difficult  task.  Moreover,  if  the 
simplification  is  just  a  process  prior  to  other  computations,  such  as  can  be  the 
case,  e.g.  in  mechanics,  then  a  high  accuracy  in  the  approximated  function 
might  be  the  primary  interest.  Thus,  it  seems  better  to  let  the  weights  depend 
on  the  application,  under  user  control. 

4.4.  Comparison  to  classical  MRA 

In  [6],  a  MRA  for  subdivision  surfaces  is  used  to  handle  both  the  geometry 
and  the  scalar  attributes  of  a  mesh.  The  presentation  given  above  makes 
the  link  between  these  methods  and  the  decimation  approach.  Indeed,  from 
the  “upper”  point  of  view  —  the  decomposition  using  the  operators  P*  —  it 
compares  to  decimation  in  many  respects,  whereas  it  is  also  a  decimation  step 
corresponding  to  the  approximation  spaces,  which  is  exactly  what  is  done  is 
[6]  in  the  nested  case.  However,  parameterizations  are  then  obtained  without 
a  local  projection  hypothesis,  thanks  to  the  particular  l-to-4  splitting  strategy 
that  is  performed  on  the  base  mesh. 

Acknowledgments.  We  would  like  to  thank  Jean-Claude  Leon  from  the 
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Biorthogonal  Refinable  Spline  Functions 


Tim  N.  T.  Goodman 


Abstract.  We  give  a  construction  for  refinable  spline  functions  of  degree 
n  with  compact  support  and  simple  knots  in  |2Z  which  are  biorthogonal 
to  uniform  B-splines  of  degree  n  with  simple  knots  at  ^2Z. 


§1.  Introduction 

A  function  is  refinable  if  it  is  a  linear  combination  of  dilates  of  integer  translates 
of  itself.  Such  functions  are  central  to  multiresolution  methods,  in  particular 
in  the  construction  of  wavelets.  In  general,  refinable  functions  can  be  defined 
implicitly  from  the  refinement  equation  which  they  satisfy,  but  explicit  con¬ 
structions  of  refinable  functions  are  restricted  mainly  to  spline  functions,  i.e. 
piecewise  polynomials.  If  we  require  the  natural  condition  that  the  integer 
translates  of  the  univariate  refinable  spline  function  with  compact  support 
form  a  Riesz  basis,  then  (j)  can  only  be  a  uniform  B-spline  with  simple  knots 
[4].  However  there  is  more  flexibility  if  we  replace  the  single  function  (f)  by 
a  refinable  vector  of  spline  functions  . . . ,  For  a  survey  on  refinable 
spline  functions,  see  [2], 

In  multiresolution  methods,  orthogonality  plays  an  important  role.  In  [1], 
constructions  are  given  for  refinable  functions  whose  integer  translates  are 
biorthogonal  to  a  given  refinable  function  (p,  in  particular  when  ^  is  a  uniform 
B-spline  with  simple  knots.  However  these  dual  functions  are  not  defined 
explicitly.  We  give,  in  Section  3,  constructions  for  refinable  spline  functions 
of  compact  support  which  are  biorthogonal  to  uniform  B-splines  with  simple 
knots.  This  requires  refinable  vectors  of  three  functions:  the  uniform  B-splines 
have  knots  in  ^7Z,  while  the  dual  functions  have  the  same  degree  and  simple 
knots  in  The  construction  is  based  on  a  general  result  in  Section  2  giving 
necessary  and  sufficient  conditions  for  biorthogonality  of  certain  vectors  of 
(not  necessarily  refinable)  functions  in  terms  of  a  Grammian  matrix. 
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§2.  Biorthogonal  Basic  Sets 

Let  (^i,...,0r  be  compactly  supported  real- valued  functions  in  L^(]R).  We 
say  {<^1, . . .  is  a  basic  set  for  a  space  V  \iV  comprises  all  real,  finite, 
linear  combinations  of  integer  translates  of  ^i, . . . , 

We  say  basic  sets  and  {V'l, . . .  jV’r}  are  biorthogonal  (or  the 

basic  set  {V'l,  •  •  •  ,^r}  is  dual  to  {0i, . . .  if 

/  keTL. 

J  —  oo 

A  basic  set  {^i, . . . ,  <f>r}  is  said  to  be  stable  if  {4>i[-  —  j)  :  i  =  1, . . . ,  r,  j  G 
7L}  forms  a  Riesz  basis,  i.e.  for  some  A,  B  >  0, 

r  oo  «oo  r  oo  r  oo 

E  4^/  EE  <bE  E  4 

J  =  1  j  =  —  oo  °°  1  =  1  j  =  — oo  1  =  1  J  =  — oo 


for  any  Oij  G  H,  i  =  1, . . . ,  r,  j  G  2Z. 

It  is  shown  in  [3]  that  {0i, . . .  is  stable  if  and  only  if  for  each  u  in 
H,  there  are  integers  fci , . . . ,  with 

det  ^ii{u  -f  27rfcj)j  7^  0.  (1) 

We  shall  say  a  matrix  M{z)  =  of  Laurent  polynomials  is 

invertible  if  it  has  an  inverse  which  is  a  matrix  of  Laurent  polynomials,  i.e. 

det  M{z)  =  az\  some  a  7^  0,  I  eTL. 

Lemma  1.  If  {<^1, . . . ,  <^r}  is  a  stable  basic  set  for  V,  then  {ipi,  in  V 

also  forms  a  stable  basic  set  for  V  if  and  only  if  s  =  r  and 

r  00 

*  =  E  E  i  =  (2) 

j—\  k——oo 


where  the  matrix  of  Laurent  polynomials 
Aiz)  := 


E  Aijik)z' 

Lfc  =  — OO 


is  invertible. 

Before  proving  this  lemma,  it  will  be  useful  to  introduce  the  following 
vector  notation.  For  a  basic  set  ,  <f>r},  we  let  cf)  denote  the  column 

vector  (^1, . . .  ,<l>r)^-  Then  we  can  write  (2)  as 

00 

ft=— 00 


(3) 
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where  Ak  is  the  matrix  Taking  Fourier  transforms  then  gives 


(4) 


where  z  =  e“*“. 

Proof  of  Lemma  1;  Suppose  that  (2)  holds,  where  A[z)  is  invertible.  Prom 
(4)  we  have 

^{u)  =  A{zY^i)(u). 

Since  A{z)~^  is  a  matrix  of  Laurent  polynomials,  it  follows  that  </>i, . . . , 
are  finite  linear  combinations  of  integer  translates  of  , . . . ,  ■  Since  V 

comprises  all  finite  linear  combinations  of  integer  translates  of  ...  ,07-,  it 
follows  that  {V’l, . . . ,  ^r}  is  a  basic  set  for  V. 

Also  for  any  u  in  R,  we  may  choose  integers  fci, . . . ,  fer-  so  that  (1)  holds, 
and  thus  from  (4), 

det[^i(u  4-  2'Kkj)Yi  j=i  =  det  A{z)  det[(^7(?z  +  ^  0. 


So  {-01, . . . ,  is  stable. 

Conversely  suppose  that  {'^i, .  ..,'ips}  is  a  stable  basic  set  for  V.  Then 
there  exist  an  s  x  r  matrix  A(z)  and  an  r  x  s  matrix  B(z)  of  Laurent  polyno¬ 
mials  such  that  for  z  =  e~*“. 


^(u)  =  A{z)$(u) 


and 

0(u)  =  B{z)'4’{u)  ~  B(z)A{z)${u). 

For  any  u  we  may  choose  integers  A^i, . . . ,  so  that  (1)  holds.  Since 

[$i  (u  +  2nkj )][  =  B{z)A{z)  (u  +  )]r  ,^1 , 

it  follows  that  B{z)A{z)  =  Ir^  the  r  x  r  identity  matrix.  Similarly  ^(u)  = 
A{z)B[z)'{l}[u)  and  since  {V>i, . . .  ,“^5}  is  stable,  we  can  deduce  as  above  that 
A{z)B{z)  =  Ig.  Thus  s  —  r  and  A{z)  is  invertible.  □ 

Theorem  2.  Suppose  that  {<^1, . . . ,  and  {V’l, . . . ,  V'r}  are  stable  basic  sets 
for  V  and  W  respectively.  For  k  G  Z,  i,j  =  1, . . .  ,r,  we  define 

/oo 

<f’iU--k),  (5) 

-OO 


and  let  M  denote  the  r  x  r  matrix  of  Laurent  polynomials  given  by 


M(z)  := 


(6) 
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Then  there  exist  biorthogonal  basic  sets  for  V  and  W  if  and  only  if  M  is 
invertible.  Moreover  in  this  case,  for  any  stable  basic  set  for  V  there  is  a 
unique  dual  stable  basic  set  for  W. 

Proof:  Let  {01,...,^^}  and  . . . , be  any  stable  basic  sets  for  V  and 
W  respectively.  Then  by  Lemma  2.1  we  have 

CXD  OO 

0=  ^  i>=  T.  (7) 

fczz-oo  fc  =  -00 

where  Ak,  Bk  are  r  x  r  matrices  such  that 

OO  OO 

A(z)  :=  Atz'’,  B(z)  :=  Y 

fc=-oo  k=-oo 

are  invertible. 

For  k  e7L,i,j  =  1, . . .  ,r,  define 

(9) 

J  —  OO 

and  let  M  denote  the  r  x  r  matrix  of  Laurent  polynomials  given  by 

r 

(10) 

Then  from  (2.5)-(2.10)  we  have  for  2:  =  e~^^, 

M{z)  =  A{z)M{z)B{zy,  (11) 

where  B{zy  =  B{z)  =  B{z~y)'^ . 

Now  by  (2.9)  and  (2.10),  0  and  ^  are  biorthogonal  if  and  only  if  M{z)  =  L 
If  this  holds,  then  by  (11),  M  must  be  invertible.  Conversely,  if  M  is  invertible, 
then  for  any  choice  of  ^4(2:)  we  can  define  B{z)  uniquely  by 

B{z)  =  {M(z)-^A{z)-^y 

so  that  (11)  holds  with  M{z)  =  /.  Thus  if  M  is  invertible,  then  for  any  stable 
basic  set  ^  for  V,  there  is  a  unique  dual  basic  set  ^  for  VF.  □ 

We  remark  that  from  the  definition,  any  biorthogonal  basic  sets  must 
have  linearly  independent  integer  translates.  Thus  if  M  as  in  Theorem  2.1  is 
invertible,  any  basic  set  for  F  or  W  which  is  stable  must  in  fact  have  linearly 
independent  integer  translates. 

Now  for  m  G  K,  m  >  2,  we  say  a  space  V  of  functions  on  IR  is  m-refinable 


M(z)  := 


Y 


Lfc  =  — OO 


if 
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If  is  a  basic  set  for  an  m-refinable  space  V,  then  is  in  V,  and  so 

CO 

JTl 

k——oo 

for  some  finite  set  of  matrices  Cfe.  Thus,  (j)  satisfies  the  refinement  equation 

c» 

4>=  -  k). 

k=—oo 

In  this  paper  we  shall,  for  simplicity,  consider  only  the  case  m  —  2. 

§3.  Biorthogonal  Refinable  Splines 

For  any  integer  n  >  1  we  let  Nn  denote  the  uniform  B-spline  of  degree  n  with 
simple  knots  at  0, 1, . . . ,  n  +  1.  We  now  fix  n,  and  define 

(a:)  ^  Wn(3x),  02 w  =  Nni^x  -  1),  03 (x)  =  iV„(3rc  -  2). 

Then  {^i,  02,  ^3}  is  a  stable  basic  set  for  the  space  V  of  all  spline  functions 
of  degree  n  with  compact  support  and  simple  knots  at  Clearly  V  is 

refinable.  We  wish  to  find  a  refinable  space  W  of  spline  functions  of  compact 
support  which  has  a  basic  set  which  is  dual  to  {0i,  02,  ^s}.  From  Theorem  2 
we  see  that  this  is  equivalent  to  finding  a  basic  set  {V'l, '02, for  IF  such 
that  the  matrix  M  in  (6)  is  invertible. 

We  shall  choose 

1pl(x)  =  Nn{2x),  02(ic)  =  Nn{2x  -  1), 

and  03  to  be  a  spline  function  of  degree  n  with  knots  in  and  support  in 
[| ,  2n  —  .  Thus  IF  is  a  space  of  spline  functions  of  degree  n  and  simple  knots 

in  is,  which  contains  all  spline  functions  of  degree  n  with  compact  support 
and  simple  knots  in  |2Z.  For  any  function  /  in  IF,  /(^)  has  knots  in  ^TZ,  and 
so  lies  in  W.  Thus  IF  is  refinable. 

Theorem  3.  We  can  choose  03  as  above  so  that  there  are  biorthogonal  basic 
sets  for  V  and  W,  or  equivalently  that  there  is  a  unique  basic  set  for  IF  dual 
to  the  basic  set  {0i,  02, 03}- 

Proof:  The  space  IFq  of  spline  functions  of  degree  n  with  simple  knots  in  |2Z 
and  support  in  [0, 2n]  has  dimension  3n,  while  the  space  IFi  of  spline  functions 
of  degree  n  with  simple  knots  in  |2Z  and  support  in  [0, 2n]  has  dimension  7n. 
Thus  we  may  choose  linearly  independent  functions  /i,...,/4n  in  IFi  with 
support  in  [^,2n  —  |]  which  together  with  Wq  span  Wi.  We  write 


4n 

03  =  ^Cikfk’ 

k=l 
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It  remains  to  choose  ai,...,a4n  so  that  M  in  (6)  is  invertible.  Now 
Mij{k)  =  0  except  in  the  following  cases;  Mii{k),  <  k  <  ^;  Mi2ik), 
<  fc  <  ^;  Mnik),  -2n  +  l  <  k  <  f,  M21W,  -%  <  k  <  a±i,- 

Mn{k),  <  f;  M22(k],  -2n  +  1  <  A:  <  2±i;  Mu{k),  - ^  <  *  < 

M32(k),  -f  <  fc  <  M33(k),  -2n  +  \<k<^.  Then 

n 

detM(2;)=  ^ 

fc=-3n+l 

for  some  numbers  6^,  — 3n  +  1  <  fc  <  n,  which  are  linear  functions  of 
ai,...,a4„.  The  condition  detM(2:)  =  1  then  gives  4n  linear  equations  in 
4n  unknowns  ai , . . . ,  a4n  and  we  shall  show  that  this  system  is  non-singular. 

Suppose,  to  the  contrary,  that  the  system  is  singular.  Then  we  may 
choose  Ui, . . .  ,a4„,  not  all  zero,  so  that  detM(2)  =  0.  Then  the  columns  of 
M  are  linearly  dependent  in  the  sense  that  there  are  Laurent  polynomials  pi , 
P2>  P3>  not  all  zero,  so  that 

3 

^  M{z)ijPj{z)  =0,  2  =  1,2, 3. 

i=i 

Writing 

oo 

P3{^)=  i  =  1,2,3, 

k=  —  oo 

this  becomes 

3  CX3  oo 

E  E  =  i  =  1.2.3, 

j  =  l  k=  —  oo  l—~(x> 


EE  Mij{l)cj{k  —  1)  =  0,  2  =  1,2,3,  fc  G  2Z, 

j  =  l  f=  — oo 

which  on  recalling  (5)  gives 


/: 


M-  “  fc)/  =  «  =  1i2,3, 


keTl, 


f  =  - 1). 

j  =  l  l=-oo 

Thus  /  is  orthogonal  to  V.  Note  that  since  the  integer  translates  of 
■01  >  02?  03  Q-re  linearly  independent,  /  is  not  identically  zero.  Since  only  a 
finite  number  of  coefficients  Cj(—l),  j  =  1,2,3,  I  €  ZZ,  are  non-zero,  /  has 
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compact  support.  Suppose  that  the  support  is  [«,/?  +  2n],  a,  6  ZZ,  but  not 
in  [a  +  1,  p  +  2n]  or  [a,  /?  +  2n  -  1].  It  is  easily  seen  that  a  <  /?  and 

j=a 

where  ^  ^  ^  and  g  has  support  in  [o,  /?  +  2n]  and  lies  in  the  space  of 
splines  of  degree  n  with  knots  in  ^ZZ,  which  we  shall  denote  by  Z. 

We  first  note  that  for  some  hi  in  Z  with  support  in  [/?  +  ^  ~  1,  +  2n], 


7]i  —  -l3)+hi 


is  orthogonal  to  those  elements  of  V  with  support  in  [/?  +  2n  -  1,  oo).  Now 


is  orthogonal  to  those  elements  of  V  with  support  in  [P  +  2n  —  2,  oo)  and  on 
this  interval  coincides  with  a  function 


m  :=  -P)  +  h2, 

where  ^2  is  in  Z  with  support  in  ^  —  2,  /?  +  2n].  Continuing  in  this  way 
we  recursively  construct 


Vj  ■=  -  /?)  +  hj,  i  =  1, . . .  ,4n, 

which  is  orthogonal  to  those  elements  of  V  with  support  in  [p  +  2n  —  j,  oo), 
where  hj  is  in  Z  with  support  in  [P  +  ^  —  j,P  2n].  In  particular,  r)4n  has 
support  in  [P  —  ^,P  +  2n]  and  is  orthogonal  to  those  elements  of  V  with 
support  in  [P  —  2n,  oo). 

Choose  F  with  =  r]4n  and  with  support  in  (-oo,/?  +  2n].  Now  for 

j  =  0, . . . ,  12n  —  1,  let  Bj  be  the  B-spline  of  degree  n  with  knots  p  —  2n-{- 
/3-2n+^,...,/?-2n+i±ftl.  Then  since  is  in  7, 

/oo  I>00 

-OO  J  —  oo 

=  [/3-2„+|,/J-2n  +  i±i,...,/J-2™  +  a|±i]K 

Thus  F  vanishes  at  P  —  2n  j  =  0, . . . ,  12n  —  1.  Now  F  coincides  on 
[P~2n,p-\-2n]  with  a  spline  G  of  degree  2n + 1  with  support  [/9 — 3n  —  ^ -f  2n] 
with  knots  at 

/5-3n-  /?,/?+  i,/3 -I-  i,...,/3  +  2n-  i,^  +  2n. 
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It  then  follows  from  the  Schoenberg- Whitney  Theorem  [5]  that  G  vanishes 
identically  on  [/?  —  2n,(3  +  2n]  and  hence  so  does  So  774^  has  support  in 
[/9  ”  —  2n]  with  knots  in  ^7L,  and  so  774^  vanishes  identically,  which  is 

a  contradiction.  Thus  the  linear  system  is  non-singular,  which  completes  the 
proof.  □ 

Finally  we  note  that  if  7/^3  is  as  in  Theorem  3  and  M  is  given  by  (6),  then 
the  basic  set  for  W  dual  to  {0i,<^2,03}  is  given  by 

00 

k=—oo 


where 


C,z>‘  =  {M{z)-^y  =  (adj  M{z)y  =  (adj  M{z-')f. 

k=-oo 
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Fitting  Parametric  Curves  to 
Dense  and  Noisy  Points 

A.  Ardeshir  Goshtasby 


Abstract.  Given  a  large  set  of  irregularly  spaced  points  in  the  plane,  an 
algorithm  for  partitioning  the  points  into  subsets  and  fitting  a  parametric 
curve  to  each  subset  is  described.  The  points  could  be  measurements  from 
a  physical  phenomenon,  and  the  objective  in  this  process  could  be  to  find 
patterns  among  the  points  and  describe  the  phenomenon  analytically.  The 
points  could  be  measurements  from  a  geometric  model,  and  the  objective 
could  be  to  reconstruct  the  model  by  a  combination  of  parametric  curves. 
The  algorithm  proposed  here  can  be  used  in  various  applications,  especially 
where  given  points  are  dense  and  noisy. 


§1.  Introduction 

In  many  science  and  engineering  problems  there  is  a  need  to  fit  a  curve  or 
curves  to  an  irregularly  spaced  set  of  points.  Curve  fitting  has  been  studied 
extensively  in  Approximation  Theory  and  Geometric  Modeling,  and  there  are 
numerous  books  on  the  subject  [1,5,6,12,23].  Existing  techniques  typically 
find  a  single  curve  segment  that  approximates  or  interpolates  the  given  points. 
Many  techniques  assume  that  the  points  are  ordered  and  fit  a  curve  to  them 
by  minimizing  an  error  criterion  [3,7,8,14,16,22,27,29,31,34].  If  the  points 
are  ordered,  piecewise  polynomial  curves  can  also  be  fitted  to  them  [19,30]. 
Difficulties  arise  when  the  points  are  not  ordered. 

To  fit  curves  to  an  irregularly  spaced  set  of  points,  1)  the  set  should  be 
partitioned  into  subsets,  2)  the  points  in  each  subset  should  be  ordered,  and 
3)  a  curve  should  be  fitted  to  points  in  each  subset.  This  paper  will  provide 
solutions  to  the  first  two  problems;  that  is,  partitioning  a  point  set  into  subsets 
and  ordering  the  points  in  each  subset.  Once  the  points  in  each  subset  are 
ordered,  existing  techniques  can  be  used  to  find  the  curves. 

Given  a  large  set  of  irregularly  spaced  points  in  the  plane,  {p^  =  {xi,yi)  : 
i  =  we  would  like  to  fit  one  or  more  parametric  curves  to  the 

points,  with  the  number  of  the  curves  to  depend  on  the  organization  of  the 
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points  and  the  resolution  of  the  representation.  When  fitting  a  parametric 
curve  to  an  irregularly  spaced  set  of  points,  the  main  problem  is  to  find  the 
nodes  of  the  curve.  The  nodes  of  a  parametric  curve  determine  the  adjacency 
relation  between  the  points  and  order  them.  The  curve  will  then  approximate 
the  points  in  the  order  specified.  Methods  to  order  sparse  points  [11,17,24] 
as  well  as  dense  points  [25,26,32]  have  been  developed.  Existing  methods, 
however,  fit  a  single  curve  segment  to  an  entire  data  set.  Sometimes  it  is  not 
desirable  to  fit  a  single  curve  segment  to  a  large  and  complex  point  set,  and 
it  is  necessary  to  represent  the  geometric  structure  present  in  the  point  set 
by  many  curve  segments.  In  this  paper  it  will  be  shown  how  to  partition  a 
point  set  into  subsets  and  how  to  fit  a  parametric  curve  to  each  subset.  A 
new  method  to  order  a  set  of  dense  and  noisy  points  for  curve  fitting  will  also 
be  presented. 

In  the  proposed  model,  a  radial  field  is  centered  at  each  point  such  that 
the  strength  of  the  field  monotonically  decreases  as  one  moves  away  from  the 
point.  The  sum  of  the  fields  has  the  averaging  effect  and  reduces  the  effect 
of  noise,  and  local  maxima  of  the  sum  of  the  fields  has  the  effect  of  tracing 
the  spine  of  the  points.  Therefore,  we  will  use  the  local  maxima  of  the  sum 
of  the  fields  (the  ridges  of  the  obtained  field  surface)  as  an  approximation  to 
the  curves  to  be  determined.  Based  on  the  organization  of  the  points,  disjoint 
ridges  may  be  obtained,  each  suggesting  a  curve.  The  ridges  will  be  used 
to  partition  the  points  into  subsets  and  fit  a  curve  to  each  subset.  In  the 
following,  the  steps  of  this  process  are  described  in  detail. 

§2.  Approach 

A  desirable  property  of  an  approximating  curve  is  for  it  to  pass  as  close  as 
possible  to  the  given  points  while  providing  a  certain  smoothness  appearance. 
For  a  dense  point  set,  the  curve  cannot  pass  close  to  all  the  points,  so  it  is 
desired  that  the  curve  trace  the  spine  of  the  points.  In  the  model  proposed 
here,  an  initial  estimation  to  a  curve  is  obtained  by  taking  points  in  the  xy 
plane  whose  sum  of  inverse  distances  to  the  given  points  is  locally  maximum. 
That  is,  if  the  sum  of  inverse  distances  of  point  (x,  y)  to  given  points  {(xj,  ?/i)  : 
i  =  is  larger  than  the  sum  of  inverse  distances  of  points  in  the 

neighborhood  of  (x,2/)  to  the  given  points,  then  point  (x,?/)  is  considered  an 
initial  estimation  to  a  point  on  the  curve.  Therefore,  by  tracing  points  in  the 
xy  plane  that  locally  maximize 

y)  =  Yl  +  (2/  -  2/0^  +  l]  ”  ^  (1) 

1=1 

we  find  an  approximation  to  the  curves  we  want  to  find. 

The  function  /  can  also  be  interpreted  cis  follows:  Suppose  a  radial  field 
of  strength  1  is  centered  at  point  (xi,  i/i),  i  =  1, . . . ,  A,  such  that  the  strength 
of  the  field  decreases  with  inverse  distance  as  one  moves  away  from  the  point. 
Then,  the  strength  of  the  field  at  point  (x,  y)  will  be  [(x  -  xj^  +  (y  -  y,)^]  “ , 
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and  the  curves  to  be  found  can  be  considered  points  in  the  xy  plane  whose 
sum  of  field  values  are  locally  maximum. 

Once  a  set  of  points  is  given,  the  function  f  becomes  fixed,  and  the 
obtained  ridges  will  have  a  fixed  shape.  In  order  to  have  control  over  the 
shape  or  smoothness  of  obtained  ridges,  we  revise  formula  (1)  as  follows.  If 

instead  of  inverse  distances  defined  by  [(x  —  +  (2/  —  2/i)^  +  l]  ^ ,  we  use 

[(x  -  XiY  +  {y-  yif  +  r^]~^  (2) 

in  equation  (1),  we  obtain 


y)  =  ^  [(a^  -Xif  +  {y-  Vi?  +  r^]  ^  (3) 

Z=1 

The  basis  functions  defined  by  (2)  are  known  as  inverse  multiquadrics  [13]. 
The  parameter  r  of  the  basis  functions  can  be  varied  to  generate  different 
surfaces  [21].  Figure  lb  shows  the  field  surface  obtained  when  using  the  points 
of  Fig.  la  and  inverse  multiquadric  basis  functions  with  r  =  5. 

Instead  of  inverse  multiquadric  basis  functions,  other  radial  basis  func¬ 
tions  [2,4,10,28,33,35]  also  can  be  used  to  define  function  g.  The  choice  of  the 
basis  functions  influences  the  shape  of  the  obtained  field  surface,  the  shape  of 
the  obtained  ridges,  and,  consequently,  the  shape  of  the  obtained  curves. 

By  tracing  the  local  maxima  of  the  field  surface  g  in  the  xy  plane,  we  will 
obtain  an  approximation  to  the  curves.  Parameter  r  changes  the  shape  of  the 
basis  functions  and  affects  the  shape  of  the  field  surface. 

Local  maxima  of  surface  g  can  result  in  structures  that  contain  branches 
and  loops.  The  proposed  model,  therefore,  can  recover  very  complex  patterns 
in  dense  and  noisy  point  sets.  Note  also  that  the  proposed  method  does 
not  require  any  knowledge  about  the  adjacency  relation  between  the  points. 
This  method,  in  fact,  provides  the  means  to  determine  the  adjacency  relation 
between  the  points. 


§3,  Implementation 

Derivation  of  an  analytic  formula  that  represents  the  local  maxima  of  the 
surface  g  may  not  be  possible.  Digital  approximation  to  the  local  maxima, 
however,  is  possible.  This  approximation  is  found  in  the  form  of  digital  con¬ 
tours  and  is  used  to  partition  the  points  into  subsets.  To  digitally  trace  surface 
ridges,  the  surface  is  digitized  into  a  digital  image.  The  digitization  process 
involves  starting  from  x  =  x^in  and  y  =  ymin  and  incrementing  x  and  y  by 
some  small  increment  S  until  reaching  x  =  Xmax  and  y  =  ymax-  For  each 
discrete  (x, y),  the  value  for  g{x^y)  is  then  found  from  formula  (3).  Xmin  and 
Xmax  could  be  the  smallest  and  largest  x  coordinates,  and  ymin  and  ymax  could 
be  the  smallest  and  largest  y  coordinates  of  the  given  points.  The  parameter 
6  is  used  as  the  increment  for  both  x  and  y  because  radially  symmetric  basis 
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functions  are  used  to  define  g.  This  parameter  determines  the  resolution  of 
the  obtained  image.  For  a  finer  resolution,  this  parameter  should  be  reduced, 
while  for  a  coarser  resolution  this  parameter  should  be  increased.  If  this  pa¬ 
rameter  is  to  be  chosen  automatically,  it  should  be  selected  such  that  most 
given  points  map  to  unique  pixels  in  the  obtained  image. 

Digitizing  the  surface  g  in  this  manner  will  result  in  a  digital  image  whose 
pixel  values  show  uniform  samples  from  surface  g.  Figure  lb  shows  digitization 
of  a  field  surface  into  an  image  of  256  x  256  pixels.  To  find  the  image  ridges, 
pixels  with  locally  maximum  intensities  are  located.  To  find  locally  maximum 
image  intensities,  the  gradient  magnitude  and  the  gradient  direction  [20]  of 
the  image  at  each  pixel  are  determined.  The  gradient  direction  at  a  pixel 
is  the  direction  at  which  change  in  intensity  at  the  pixel  is  maximum,  and 
gradient  magnitude  is  the  magnitude  of  the  intensity  change  in  the  gradient 
direction  at  the  pixel. 

To  find  the  ridges,  we  find  each  pixel  A  in  the  image  where  two  pixels  B 
and  C  that  are  adjacent  to  it  and  are  at  its  opposite  sides  have  intensities  that 
are  smaller  than  that  at  A.  Assuming  that  the  image  obtained  after  digitizing 
surface  g  is  represented  by  /,  we  mark  the  pixel  at  {i,j)  as  A  if  one  of  the 
following  is  true: 


&  i{i  +  i,j)<i(i,jy,  (4) 

I{i,j -1)  <  I{i,j)  &  I(i,j +  1)  <  (5) 

-1)  <  I{i,j)  k  /(i  +  1,  j  +  1)  < /(i.i);  (6) 

I(i-l,j  +  l)<I{iJ)  k  +  (7) 


Using  the  image  of  Fig.  lb,  we  find  that  pixels  in  the  contours  shown  in 
Fig.  Ic  are  marked  as  A.  We  will  call  the  contours  obtained  in  this  manner 
the  minor  ridges  of  the  image.  Next,  we  find  each  pixel  D  whose  value  is  not 
only  larger  than  those  of  B  and  C  adjacent  to  it  and  at  its  opposite  sides,  but 
which  also  has  a  gradient  direction  that  is  the  same  as  the  direction  obtained 
by  connecting  pixels  B  and  C.  The  gradient  direction  at  a  pixel  is  quantized 
with  45-degree  steps  to  ensure  that  only  directions  that  are  possible  to  obtain 
when  connecting  pixels  B  and  C  in  an  image  are  obtained.  The  pixels  marked 
as  D  are  shown  in  Fig.  Id.  We  will  call  these  contours  the  major  ridges  of  the 
image.  As  can  be  observed,  major  ridges  are  a  subset  of  minor  ridges.  We 
also  see  that  major  ridge  points  do  not  fall  on  small  and  noisy  branches  of 
the  minor  ridges  but  rather  fall  on  contours  that  represent  the  spines  of  the 
points.  If  the  minor  ridges  are  cut  at  the  branch  points,  and  branches  that 
do  not  contain  a  major  ridge  point  are  removed,  and  the  remaining  contours 
are  thinned,  we  obtain  Fig.  le.  The  obtained  contours  will  be  called  the 
local-maxima  contours,  or  simply  the  contours.  These  contours  will  be  taken 
as  approximations  to  the  curves  to  be  found.  We  will  use  them  not  only  to 
partition  the  points  into  subsets  but  also  to  order  the  points  in  the  subsets. 
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Fig.  1.  An  irregularly  spaced  set  of  points,  (bl  A  digitized  field  surface,  (c) 
Contours  representing  the  minor  ridges,  (d)  Contours  representing  the 
major  ridges,  (e)  Local-maxima  contours,  (f)  RaG  curves  with  a  =  0.04 
approximating  points  shown  in  (a). 


§4.  Node  Estimation 

The  method  outlined  in  the  preceding  section  determines  contours  that  are 
approximations  to  the  curves  to  be  found.  These  contours  will  be  used  to 
partition  a  point  set  into  subsets  and  order  the  points  in  each  subset. 

Suppose  a  point  set  has  produced  m  contours;  then,  a  point  is  assigned  to 
contour  j  {1  <  j  <m)  if  it  is  closest  to  a  pixel  in  contour  j  than  to  a  pixel  in 
any  other  contour.  In  this  manner,  a  point  is  assigned  to  one  of  m  contours. 
This  process,  when  completed,  will  partition  a  point  set  into  m  subsets  by 
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(a)  (b) 

Fig.  2.  (a),  (b)  Two  point  subsets  obtained  from  the  point  set  of  Fig.  la. 

assigning  the  points  into  one  of  m  contours.  Figures  2a  and  2b  show  the  point 
subsets  obtained  in  this  manner  from  the  point  set  of  Fig.  la. 

To  order  points  {qi  :  i  =  1, . . . ,  n}  in  subset  for  each  point  qj  a  point 
in  contour  j  that  is  closest  to  it  is  determined.  We  call  the  obtained  contour 
point  the  projection  of  point  qj.  After  determining  projections  of  all  points  in 
the  subset  to  the  contour,  the  contour  is  traced  from  one  end  to  the  other, 
and  in  the  order  the  projections  are  visited,  the  associated  points  are  ordered. 

Since  the  contours  are  approximations  to  the  curves  to  be  found,  the 
contour  length  from  a  projection  to  the  start  of  the  contour  is  divided  by  the 
length  of  the  contour  to  obtain  an  arc-length  estimation  to  the  node  of  the 
point.  If  the  contour  is  closed,  an  arbitrary  point  on  the  contour  is  taken  as 
the  start  point.  If  the  contour  is  open,  one  of  the  end  points  is  taken  as  the 
start  point. 

The  size  of  the  image  obtained  by  digitizing  surface  g  determines  the 
accuracy  of  the  obtained  nodes.  If  the  surface  g  is  very  coarsely  digitized,  the 
obtained  contours  will  be  very  short,  and  numerous  points  may  produce  the 
same  node,  especially  when  given  points  are  dense.  To  provide  a  more  accurate 
node  estimation,  the  surface  g  should  be  digitized  into  an  image  large  enough 
to  produce  unique  nodes. 

Once  the  coordinates  of  given  points  and  the  associated  nodes  are  known, 
a  parametric  curve  can  be  fitted  to  the  points  by  one  of  the  existing  methods 
[9,11,16,18,30].  Fitting  rational  Gaussian  (RaG)  curves  [9]  to  the  points  shown 
in  Fig.  la  with  nodes  as  determined  above,  we  obtain  the  curves  shown  in 
Fig.  If.  The  curves  are  overlaid  with  the  original  points  to  show  the  quality 
of  the  curve  fitting.  Note  that  these  curves  were  obtained  using  the  points  in 
Fig.  la  and  not  the  contour  points  in  Fig.  le.  The  contour  points  were  used 
only  to  partition  a  point  set  into  subsets  and  to  determine  the  nodes  of  the 
points. 


§5.  Observations 

To  observe  the  behavior  of  the  proposed  curve-fitting  method,  results  on  three 
additional  point  sets  are  shown  in  Fig.  3.  Figure  3a  shows  noisy  points  along 
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Fig.  3.  A  few  curve-fitting  examples. 


an  open  contour,  Fig,  3c  shows  a  dense  and  noisy  point  set  along  the  silhouette 
of  a  coffee  mug,  Fig.  3e  shows  irregularly  spaced  points  along  the  silhouette 
of  a  model  plane  and  one  of  its  wings.  We  can  see  the  geometric  structures 
in  these  point  sets  and,  if  asked,  can  trace  the  structures  manually  without 
any  difficulty.  The  algorithm  proposed  here  is  intended  to  do  the  same.  The 
curves  obtained  are  shown  in  Figs.  3b,  3d,  and  3f. 

The  point  sets  shown  in  Fig.  3  did  not  contain  geometric  structures  with 
branches  and  loops.  If  a  point  set  contains  branches  and  loops,  the  local- 
maxima  contours  will  also  contain  branches  and  loops.  A  single  curve  segment, 
however,  cannot  represent  branching  structures.  The  solution  we  propose  is 
to  segment  a  complex  contour  into  simple  ones  by  cutting  it  at  the  branch 
points  and  fitting  a  curve  to  each  branch. 
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§6.  Summary  and  Conclusions 

A  large  number  of  techniques  for  fitting  parametric  curves  to  irregularly  spaced 
points  have  been  developed.  These  techniques  fit  a  single  curve  to  the  given 
points  and  often  require  that  the  points  be  ordered.  In  science  and  engineering 
problems  that  deal  with  measurement  data,  the  given  points  may  not  be 
ordered  and  they  may  contain  noise.  Moreover,  it  may  not  be  appropriate 
to  fit  a  single  curve  segment  to  all  the  points.  In  this  paper,  a  method  to 
partition  a  point  set  into  subsets  and  fit  a  parametric  curve  to  each  subset 
was  described.  The  proposed  method  has  the  ability  to  take  into  consideration 
the  noisiness  and  denseness  of  a  point  set  when  obtaining  the  curves. 

Also  introduced  was  a  method  to  determine  the  nodes  of  a  parametric 
curve  that  approximates  a  set  of  dense  and  noisy  points.  The  proposed  method 
provides  the  means  to  fit  any  parametric  curve,  including  B-Splines  and  Non- 
Uniform  Rational  B-Splines,  to  irregularly  spaced  points.  Although  in  this 
paper  only  inverse  multiquadrics  were  used  as  basis  functions  to  obtain  a  field 
surface,  from  which  the  curve  segments  were  determined,  other  radial  basis 
functions  [33]  can  be  used  in  the  same  manner.  Depending  on  the  parametric 
curve  formulation  and  the  radial  basis  functions  used,  the  number  and  the 
shapes  of  the  curves  fitting  to  a  set  of  points  may  vary. 
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and  Riadh  Taleb 


Abstract.  The  construction  of  a  smooth  surface  from  an  irregular  mesh 
in  space  is  considered.  The  mesh  vertices  can  either  be  interpolated  or 
approximated  as  a  control  net.  A  collection  of  triangular  Bezier  patches 
results  from  a  local,  affine  invariant  and  visually  smooth  interpolation 
scheme  that  can  represent  surfaces  of  arbitrary  topological  type.  It  is 
based  on  a  domain  4-split.  Beside  the  surface  construction  scheme,  the 
optimal  employment  of  the  numerous  degrees  of  freedom  is  crucial  for  an 
overall  pleasing  shape.  Different  local  minimum  norm  criteria  are  tested 
to  see  if  they  produce  satisfactory  shapes. 


§1.  Introduction 

The  general  problem  of  constructing  a  parametric  triangular  continuous 
surface  interpolating  an  irregular  mesh  in  space  has  been  considered  by  many 
authors.  In  [5]  a  survey  of  such  schemes  is  given,  and  it  is  concluded  that 
local  polynomial  interpolants  have  similar  shape  defects  due  to  the  absence  of 
an  optimization  strategy  for  using  the  free  parameters  (a  special  solution  has 
been  proposed  in  [6]  for  one  of  these  schemes). 

A  different  method  has  been  developed  by  Loop  [4]  producing  a  collec¬ 
tion  of  patches  that  meet  each  other  with  continuity.  The  vertex  enclo¬ 
sure  problem,  which  occurs  when  joining  with  G^  continuity  an  even  number 
of  polynomial  patches  around  a  vertex,  is  solved  by  first  constructing  C^- 
consistent  boundary  curves  and  cross-boundary  tangents  and  then  filling  in 
the  patches.  In  one-to-one  correspondence  to  the  mesh  faces,  sextic  triangular 
Bezier  patches  are  constructed,  which  lead  to  a  very  small  number  of  degrees 
of  freedom.  One  scalar  value  per  vertex  controls  the  length  of  the  tangents 
of  the  boundary  curves  at  the  end  points  and  one  control  point  per  patch  is 
free.  This  is  not  enough  for  sufficient  control  of  the  shape  of  a  sextic  patch.  In 
[5],  it  was  stated  that  well-shaped  boundary  curves  are  a  necessary  condition. 
Loop’s  scheme  doesn’t  provide  any  influence  on  the  second  derivatives  of  the 
boundary  curves,  which  can  lead  to  undulations.  It  was  therefore  proposed  to 
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relax  the  interpolation  condition,  which  leads  to  an  extra  free  parameter  per 
vertex  controlling  the  distance  from  the  patch  vertices  to  their  corresponding 
mesh  vertices.  This  is  clearly  improving  the  shape. 

Recently,  another  triangular  interpolation  scheme  has  been  developed 
[3].  A  regular  domain  4-split  leads  to  the  construction  of  four  quintic  Bezier 
patches  which  form  a  macro-patch  in  one-to-one  correspondence  to  a  mesh  face. 
They  have  one  polynomial  degree  less  than  Loop’s  scheme,  but  one  degree 
more  than  Piper’s  or  Shirman-Sequin’s  method  [8,9].  The  domain  4-split  is 
a  new  approach  in  triangular  mesh  interpolation,  and  has  several  obvious 
advantages:  the  boundary  curves  and  cross-boundary  tangents  are  piecewise 
polynomial.  They  can  therefore  be  of  low  degree  and  simultaneously  separate 
first  and  second  derivatives  of  the  boundary  curves  of  the  macro-patch  corners 
from  the  neighbours.  The  scheme  is  completely  local.  Furthermore,  the  4-split 
leads  to  four  patches  per  macro-patch  which  leaves  enough  control  points  free 
for  inner  shape  control.  Finally  this  scheme  offers  two  parameters  per  vertex 
for  controlling  first  and  second  derivatives  of  the  boundary  curves,  and  six 
free  control  points  inside  the  macro-patch.  Additionally,  the  interpolation 
condition  can  also  be  relaxed  to  gain  one  more  free  parameter  per  vertex. 

The  present  paper  investigates  the  problem  of  how  the  free  parameters 
and  control  points  of  the  4-split  domain  method  can  be  employed  optimally. 
The  challenge  is  to  get  an  overall  satisfactory  shape,  which  is  a  global  re¬ 
quirement,  while  maintaining  the  locality  property  of  the  scheme.  Various 
geometric  and  variational  criteria  are  proposed  and  compared. 


§2.  Triangular  Interpolation  by  4-splitting  Domain  Triangles 
2.1  Notations 

The  surface  mesh  M  is  input,  and  consists  of  a  list  of  vertices  and  edges  de¬ 
scribing  a  2-manifold  triangulated  mesh  in  The  surface  S  which  interpo¬ 
lates  the  vertices  of  A4  is  composed  of  triangular  macro- patches  M'  which  are 
in  one-to-one  correspondence  with  the  mesh  facets.  It  is  therefore  convenient 
for  the  construction  of  S  to  choose  a  parameterization  of  the  macro-patches 
M*  around  a  common  vertex,  sharing  pairwise  a  common  boundary  as  illus¬ 
trated  in  Fig.  1.  All  subscripts  i  =  1, . . . ,  n  are  taken  modulo  n,  where  n  is 
the  order  of  the  mesh  vertex  corresponding  to  M‘  (0,0).  The  parameter  Ui  lies 
in  the  interval  [0, 1]. 

The  fundamental  idea  of  the  present  triangular  interpolation  scheme  is  to 
subdivide  the  domain  triangle  into  four  subtriangles  by  joining  the  edge  mid¬ 
points  together,  see  Fig.  1.  Each  macro-patch  will  therefore  be  a  piecewise 
polynomial  image  of  the  unit  triangle  in  composed  of  four  quintic  Bezier 
triangles  [2]  each.  The  macro-patches  will  join  together  with  continuity. 
The  resulting  surface  S  will  also  be 

The  G^  conditions  which  are  used  in  this  paper  are  subject  to  some  simpli¬ 
fying  assumption  in  order  to  keep  the  interpolation  scheme  of  low  degree.  Two 
adjacent  patches  Wi)  and  M*(u,  ,'Ui+i)  join  at  a  common  boundary 
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Fig.  1.  Parameterization  and  domain  4-split, 
with  continuity  if  there  exists  a  scalar  function  such  that 

Mi(^i,0)  =  i  1  ,  (1) 

where  n  is  the  order  of  the  vertex  corresponding  to  Ui  =0.  MJ.  denotes  the 
partial  derivative  of  with  respect  to  ui. 

The  algorithm  for  constructing  the  spline  surface  consists  of  three  steps 

•  constructing  boundary  curves, 

•  constructing  cross-boundary  tangents, 

•  filling  in  the  patches, 

which  will  be  briefly  presented  in  the  following  three  subsections.  For  more 
details  and  complete  explanations  of  this  method,  the  reader  is  referred  to  [3]. 

It  is  important  to  keep  all  these  functions  of  the  lowest  degree  possible. 
The  main  contribution  to  this  comes  from  the  domain  4-split.  It  allows  for 
piecewise  polynomial  functions  of  low  degree  while  simultaneously  fulfilling  all 
other  requirements,  such  as  continuity  and  localness. 

2.2.  Boundary  curves  and  vertex  consistency 

First  the  boundary  curves  of  the  macro-patches  are  constructed  in  correspon¬ 
dence  to  the  edges  of  M  by  interpolating  the  mesh  vertices  at  the  end  points 
by  satisfying  the  conditions  at  the  vertices  and  by  keeping  the  surface 
scheme  local.  They  are  called  C^-consistent. 

Each  boundary  curve  between  two  adjacent  patches  is  a  piecewise  (2 
pieces)  cubic  Bezier  curve  parameterized  on  {0,  1}.  Around  each  vertex  of 

M,  the  control  points  6o,6i,62>  I  =  1, . .  •  of  all  incident  boundary  curves 
are  constructed  independently  from  the  joining  curve  piece  of  the  opposite  ver¬ 
tices.  The  “midpoints”  63  are  then  constructed  in  order  to  have  boundary 
curves.  See  Fig.  2  for  the  notation. 

At  a  vertex  v  the  ^j-functions  which  are  defined  on  the  incident  edges  to 
V  are  first  determined  by  calculating  $i(0)  and  $i(l)  from  system  (1)  by 
solving  it  for  Ui  =  0  and  Ui  =  1  resp.,  which  gives  =  $i{0)  =  cos(^) 
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Fig.  2.  Control  points  of  the  boundary  curves  at  vertex  V. 


and  $i(l)  =  1  —  cos(^).  The  domain  4-split  now  enables  to  seperate  vertex 
derivatives,  and  to  take  the  $i-function  to  be  piecewise  linear: 

{cos  ^(1  -  2ui)  +  Ui,  for  Ui  G  [0,  |], 

(2) 

(1  -  tti)  +  (1  ~  cos  ^){2ui  -  1),  for  Ui  G  [^,  1]. 


Let  us  now  adopt  a  matrix  notation  for  the  boundary  curve  control  points 
between  v  and  pj,  i  =  1, . . . ,  n: 


r^n 

'bV 

rp.i 

~v' 

6o  := 

1 

_ Co 

.o 

,  6i:= 

I - 

^  ... 

1 _ 

,  ^2  •  = 

b2. 

,  p  := 

'ts  ... 

3 

1 _ 

,  V  := 

where  p  is  referred  to  as  the  vertex  neighborhood  of  v. 

The  following  choice  for  the  boundary  curve  Bezier  points  near  the  vertex 
V  enables  us  to  find  a  solution  to  system  (1)  which  at  the  same  time  solves 
the  vertex  consistency  problem  [3,4]: 


bo  =  av-\-  B^p , 

bi  =  av  +  B^p,  (3) 

h  =  [(7o  +  7i)q;+  + 


where  are  n  x  n  matrices  defined  as 


n 

1  1  —  a  +  /?  cos(--^~-^) 

^  n  ’ 

,2  (70  +  7i)(l  -  q)  +  7i/^cos(-^^~-^) 

a  = - 1 - +  72 


1/6  if  j  =  i  —  1,  z  +  1, 
1/3  if  j  =  z, 

0  otherwise. 


(4) 
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The  free  parameters  a, /3, 71, 72  control  the  interpolation  and  the  first  and 
second  derivatives.  In  Section  3  it  will  be  shown  how  they  can  be  set  optimally. 
The  control  points  of  the  joining  curves  pieces  bj,  6^,  62  ^3  “  ^3  found 

by  applying  the  formulas  (3)  and  (4)  to  the  neighbouring  mesh  points  of  v. 
k  is  the  index  of  v  relative  to  the  neighborhood  of  v. 

This  curve  network  construction  is  local  in  that  changes  of  one  mesh 
vertex  only  affect  the  boundary  curve  pieces  relative  to  the  neighbourhood  of 
that  vertex. 

2.3.  Cross-boundary  tangents 

The  cross-boundary  tangents  are  subject  to  the  conditions  (1),  the  vertex 
consistency  constraints,  and  the  curve  network  of  Sect.  2.2,  and  are  set  to  be 
equal 

1  (5) 

The  scalar  function  and  the  vector  function  Vi  are  built  of  minimal  degree 
so  as  to  interpolate  the  values  of  the  cross- derivatives  and  the  twists  at  the 
vertices  p  and  p^ : 


T  /  \  .  27r,^  ,  ,  27r 

=  sm  — (1  —  Ui)  -f  sm  — Ui^ 
n  Tij 

Vii-Ui)  =  ^  v^Bl{2ui)  Ui  e  [0,  i] 
*  =  1 


(linear) 

(piecewise  quadratic) 


where 


Vo  =  V^p,  vi  =  V^p,  V2  =  ~vi  -f 


The  nx  n  matrices  and  are  given  by 

^^0  6^  .  ^27r(j-^)^  .  .  , 

Vij  =  —  sm( - ),  i,j  =  l,...,n, 

n  '  r)  ' 


(6) 

(7) 


^  -  48'/’“  +  24/)  tan(^ )  -  6/]  ^ 


_L  ^  if  i  =  z  +  1, 

^  \-l  if  i  =  z-l, 

where  =  $i(0),$^  =  $5(0)  and  =  $5(0)  are  known  from  (2)  and  (6). 


2.4.  Filling-in  the  macro-patches 

Each  macro-patch  is  composed  of  four  quintic  triangular  Bezier  patches. 
The  boundary  curves  of  the  macro-patch  are  the  twice  degree  elevated  curves 
of  Section  2.2.  The  cross-boundary  tangents  of  Sect.  2.3  determine  the  first 
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Fig  3.  C^-conditions  between  two 

adjacent  quintic  Bezier  patches. 


Fig  4.  Labelling  the  control  points  of 
a  quintic  Bezier  patch. 


Fig.  5.  Four  steps  for  filling  in  the  macro-patch  M  with  C^-continuity. 


inner  row  of  control  points  after  one  degree  elevation  [3].  The  remaining 
15  inner  control  points,  which  are  highlighted  in  Fig.  5a,  are  now  computed 
by  joining  the  four  inner  patches  with  continuity.  The  necessary  and 
sufficient  C ^-continuity  conditions  between  two  internal  Bezier  patches  inside 
one  macro-patch  are  shown  in  Fig.  3:  all  pairs  of  adjacent  triangles  must  form 
a  parallelogram.  In  [3]  it  was  shown  that  the  first  and  last  pairs  of  adjacent 
triangles  in  Fig.  3  already  form  parallelograms. 

It  remains  to  compute  the  free  Bezier  points  such  that  the  other  three  pairs 
of  triangles  along  each  edge  inside  the  macro-patch  also  form  parallelograms. 
This  is  done  in  four  steps: 

•  choose  the  three  twists  points  of  the  internal  Bezier  patch  arbitrarily; 
these  are  free  shape  parameters  (see  Fig.  5a), 

•  compute  the  third  and  fourth  Bezier  points  along  each  internal  curve 
joining  two  Bezier  patches  using  the  second  and  fourth  parallelogram 
conditions  (see  Fig.  5b), 

•  choose  the  remaining  three  unknown  Bezier  points  of  the  central  patch 
arbitrarily;  these  are  free  shape  parameters  (see  Fig.  5c), 

•  compute  the  three  remaining  unknown  Bezier  points  of  the  outer  patches 
using  the  third  parallelogram  condition  along  each  edge  (see  Fig.  5d). 

§3.  Local  Optimization  of  the  Boundary  Curves 

The  present  triangular  interpolation  method  offers  several  degrees  of  freedom 
for  shape  control.  They  can  be  set  manually  or  by  using  simple  heuristics.  An 
interactive  design  system  can  allow  for  manually  adjusting  these  parameters 
in  order  to  improve  the  shape.  This  procedure  seems  not  to  be  sufficient  if  the 
given  triangulated  point  set  is  very  large,  or  if  the  data  points  are  irregularly 
distributed. 

Our  goal  is  to  investigate  some  optimization  techniques.  Two  groups  of 
degrees  of  freedom  have  to  be  distinguished.  First  there  are  4  scalar  parame- 
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ters  per  vertex  controlling  the  curve  network,  then  6  free  inner  Bezier  control 
points  are  available  for  each  macro-patch.  Let  us  first  concentrate  on  the  curve 
network.  It  was  stated  in  [5]  that  triangular  interpolants  often  suffer  from  un¬ 
dulating  curve  networks.  It  can  be  confirmed  here  that  a  “well  shaped”  curve 
network  is  not  sufficient,  but  is  necessary  for  the  construction  of  a  pleasing 
shape.  As  pointed  out  in  Sect.  2,  the  4-split  method  is  local.  This  property 
should  not  be  altered  by  an  optimization  procedure.  Local  optimization  cri¬ 
teria  are  therefore  needed.  This  localness  requirement  conflicts  in  some  sense 
with  the  global  requirement  of  a  “well  shaped”  surface.  Every  local  scheme  has 
to  accept  this  conflict,  otherwise  it  loses  its  localness  property.  Nevertheless, 
it  will  be  shown  that  good  results  can  be  obtained. 

In  detail,  four  curve  parameters  per  vertex  are  available,  see  (3)  and 
(4).  a  is  not  really  free.  It  allows  us  to  switch  between  interpolation  and 
approximation  of  the  surface  mesh.  At  first,  we  only  consider  the  interpolation 
problem  and  set  a  =  1.0.  /?  affects  the  length  of  the  tangent  vectors  of  the 
boundary  curves  at  the  vertex  v  =  bQ.  The  control  points  bi  are  obtained  by 
a  first  order  Fourier  approximation  of  the  neighbourhood  ~  1, . . . ,  n  of  v. 
In  other  words,  the  b\  are  an  affine  image  of  a  regular  n-gon  whose  centroid  is 
bo-  Too  short  tangent  vectors  lead  to  sharp  corners  at  the  patch  vertices,  while 
too  long  tangent  vectors  can  lead  to  unwanted  undulations.  71,72  control  the 
second  derivatives  of  the  vertices.  The  control  points  62  depend  linearly  on 
them.  But  they  don’t  depend  linearly  on  /?,  71,72. 

Due  to  the  previous  observations,  the  optimization  of  ^^,71,72  should 
mainly  avoid  undulations  and  allow  for  more  or  less  bent  or  stretched  curves. 
If  for  computation-time  reasons  one  wants  to  perform  only  linear  optimization 
in  a  least-squares  sense,  as  we  do,  it  should  be  done  in  two  steps  separating  (3 
from  71,72. 

In  the  following,  the  computation  of  optimal  /?,  71,72  is  separated  into 
two  steps.  Otherwise,  the  problem  would  become  non-linear.  Each  boundary 
curve  consists  of  two  cubic  pieces  joining  with  continuity.  For  locality 
reasons  of  the  whole  scheme,  the  pieces  have  to  be  constructed  independently 
one  another.  In  a  first  step  the  points  b\  of  the  boundary  curves  incident  in 
V,  Fig.  2,  are  determined.  Each  boundary  curve  corresponds  to  an  edge  of 
the  surface  mesh.  In  this  case  we  are  looking  at  the  edge  connecting  v  and 
Pi.  In  order  to  reproduce  the  shape  inherent  to  the  underlying  surface  mesh, 
geometrical  considerations  imply  that  b[  would  optimally  lie  in  the  plane 
spanned  by  this  edge  and  a  vector  between  vq  and  the  orthogonal  projection 
of  Pi  on  the  tangent  plane  in  bQ,  as  Piper  does  in  [8].  We  call  these  points 
S*  =  [b\  The  tangent  planes  should  be  estimated  first.  The 

constraints  on  the  boundary  curves  in  the  present  method  don’t  allow  for 
setting  61  equal  b^,  this  is  why  these  points  are  approximated  in  a  least- 
squares  sense.  The  key  point  here  is  that  the  locality  of  the  equations  (3) 
and  (7)  enables  us  to  replace  the  true  neighbourhood  points  Pi  of  v  in  these 
equations  by  new  “virtual”  neighbourhood  points  p*  that  are  only  used  in 
these  equations,  i.e.  to  compute  the  boundary  curves,  and  the  cross-boundary 
tangents.  Therefore,  we  are  able  to  solve  the  following  linear  least-square 
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problem  in  order  to  compute  the  “virtual”  points  p* : 

(9) 

t  =  l 

The  new,  optimal,  control  points  are  now  given  by 

=  +  (10) 

In  a  second  step,  62  has  to  be  determined,  p  :=  p*  and  /3  are  already  fixed. 
Two  parameters  per  vertex,  71,72  are  left  free  for  optimization.  63  is  then 
fixed  as  the  midpoint  between  62  f^e  two  curve  pieces.  The  requirements 
on  62  are  twofold:  avoid  undulations  and  bend  the  curve  on  request.  The 
second  requirement  concerns  the  whole  boundary  curve  between  v  and  p^, 
and  depends  on  the  choice  of  62-  The  idea  is  to  cope  with  that  problem  by 
introducing  a  target  point  t  for  each  boundary  curve.  The  control  points  62  of 

each  curve  piece  are  then  determined  so  that  63  =  1(^2  +  ^2)  approaches  the 
target  point  by  minimizing  an  appropriate  energy  functional  on  each  curve 
piece  locally.  The  introduction  of  the  target  point  allows  for  global  control  of 
the  boundary  curves,  while  still  keeping  the  scheme  local. 

The  target  point  is  fixed  by  a  subdivision  rule  in  terms  of  60,  b[  and  Bq, 
of  the  joining  curve  piece,  such  as 

t  =  l(-6o  +  96\  +  9bi  -  So).  (11) 

Id 

62,  which  depends  linearly  on  the  free  parameters  71,72,  is  now  determined 
by  minimizing  the  linearized  version  of  the  bending  energy  combined  with  a 
curve  length  component  [1] 

f' \\X"{t)fdt  +  u;  f' \\X'(t)fdt,  u;>0.  (12) 

Jo  Jo 

The  solution  of  a  linear  2x2  system  gives  the  optimal  values  for  71, 72  for  62- 
Different  ways  for  finding  an  optimal,  i.e.  well  shaped,  curve  network 
have  been  studied.  Within  the  local  schemes,  the  concept  of  target  points 
can  be  replaced  by  target  tangent  vectors.  This  leads  to  a  2n  x  2n  linear 
system  of  equations  per  vertex.  /?,  71 , 72  can  also  be  determined  by  a  non¬ 
linear  optimization  method  in  only  one  step.  When  relaxing  the  localness 
requirement  of  the  scheme,  plenty  of  curve  network  schemes  are  possible,  like 
a  variant  of  Nielson’s  MNN  [7]  or  the  integration  of  given  curvature  values  or 
second  fundamental  forms  at  the  mesh  vertices  in  the  optimization  process. 
This  is  not  a  subject  of  the  present  paper. 

§4.  Minimum  Norm  Criteria  for  Macro-patches 

Once  the  curve  network  and  the  cross-boundary  tangents  are  constructed,  15 
inner  Bezier  control  points  remain  for  each  macro-patch.  They  are  related  to 
each  other  by  the  continuity  conditions  which  are  imposed  between  the 
four  quintic  sub-patches.  Six  of  them  are  completely  free.  They  are  drawn  as 
full  black  dots  in  Figs.  5a  and  5b.  The  remaining  9  points  depend  linearly  on 
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Fig.  6.  Left:  manually  fixed  free  parameters,  Middle:  optimized  boundary  curves, 
Right:  optimized  boundary  curves  and  patches. 


them.  It  is  therefore  possible  to  use  one  of  the  two  quadratic  functionals 
Ek{X,  X),  where  A;  =  2, 3  and 


Ek{X. 


d^x  ^  ^ 


dudvdw. 


du^  du^  dv^  dv^  dw^  dw^  J 

The  free  points  s  are  determined  by  minimizing 
3  3 

'^Et{S’,ff)  =  '^{MjS  +  ajfA{M^s  +  aj),  fc  =  2,3,  (13) 

j=0  j=0 


where  =  E|i|=5  w)  denote  the  four  quintic  Bezier  sub-patches, 

and  s  =  [sii3,  Sisi,  S311,  <8122?  «212j  ^221]*  denotes  the  vector  of  the  6  unknown 
control  points  of  the  middle  patch  (see  Figs.  4  and  3).  The  (21  x  6)  matrices 
Mj  and  the  vectors  aj  contain  the  linear  relations  between  control  points  of 
the  4  sub-patches  and  the  6  unknown  points.  The  (21  x  21)  matrix  A  is  given 
by  =  Ek{Bi,  Bj)  for  |i|  =  5,  |i|  =  5. 


§5.  Results 

Fig.  6  shows  the  interpolation  of  a  tetrahedron  by  our  method.  This  very 
simple  example  was  chosen  because  it  illustrates  clearly  the  influence  of  the 
free  parameters  and  control  points.  The  upper  row  shows  three  surfaces  with 
the  boundary  curves  of  the  macro-patches,  while  the  lower  row  shows  their 
iso-parametric  lines.  The  left  surface  is  obtained  by  manually  setting  a  =  1.0, 
/3  =  0.15,  7o  =  —1.0,  71  =  2.0,  72  =  0.0.  These  values  are  identical  for  all 
vertices  due  to  the  regularity  of  the  surface  mesh.  The  free  inner  control  points 
are  set  by  a  rule  combining  the  mesh  face  normal,  cross-boundary  tangents 
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and  the  boundary  curves.  The  middle  surface  has  optimized  boundary  curves, 
(9)- (12).  Optimal  “virtual”  neighbour  points  are  calculated,  and  the  optimal 
parameters  are  70  =  —1.598,  71  =  2.393,  72  =  0.205.  Face  energy  without 
minimization  is  equal  to  732.2.  The  right  surface  has  the  same  boundary 
curves  as  the  previous  example,  but  the  free  inner  control  points  of  the  macro¬ 
patches  are  obtained  by  minimizing  the  face  energy  (13)  with  k  =  3.  The 
energy  decreases  to  309.0.  The  connections  between  the  macro-patches  are 
sharper  for  the  manual  setting.  An  overall  more  smooth  surface  results  from 
the  optimized  parameter  setting  of  this  paper.  The  distribution  of  the  iso¬ 
parametric  lines  shows  a  positive  side-effect:  it  is  more  regular  at  the  patch 
vertices  for  the  optimized  surfaces. 
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Multi-Level  Approximation  to  Scattered 
Data  Using  Inverse  Multiquadrics 

S.  J.  Hales  and  J.  Levesley 


Abstract.  A  method  of  finding  local  approximations  is  used  to  thin 
data  before  a  hierarchical  iterative  refinement  scheme  is  employed  in  con¬ 
junction  with  domain  decomposition.  The  interpolation  problem  on  each 
sub-domain  is  solved  by  using  the  same  stored  inverse.  The  approxima¬ 
tion  power  of  the  inverse  multiquadric  is  exploited  whilst  overcoining  the 
computational  difficulties  associated  with  globally  supported  basis  func¬ 
tions. 


§1.  Introduction 

Radial  basis  functions  have  been  widely  used  for  multivariate  interpolation  of 
scattered  data,  see  [4]  for  a  summary.  An  interpolant  is  generated  by  a  linear 
combination  of  basis  functions  (j)  at  distinct  centres  Xj,  z  =  1, . . . ,  iV; 

N 

i~l 

constrained  by  5(xi)  =  /j  ,  i  =  1, . . . ,  iV,  where  T  :  i— ^  R  and  fi  = 

The  interpolation  matrix  A  E  R^  x  R^  is  given  by  = 

(i>{\\xi  —  Xjll),  and  A  satisfies 

^A  =  /,  (2) 

where  A  =  [Ai  •  •  •  A/v]^  and  /  =  [/i  •  " 

Common  choices  for  (f)  in  this  setting  are  given  in  [6], 

(l){\\x  -  arill)  =  exp(-c2||a;  -  Gaussian, 

0(lja:  -  Xfjj)  =  (c^  +  \\x  -  Inverse  multiquadric, 

where  c  is  a  constant  shape  parameter.  With  a  small  modification  to  the 
scheme,  the  thin  plate  spline  and  multiquadric  are  also  used. 
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The  above  parameter-dependent  functions  are  good  at  approximating 
data  for  certain  values  of  c,  but  these  cause  inherent  ill-conditioning  in  A. 
Schaback  [7]  explains  this  phenomenon  by  means  of  an  “Uncertainty  Relation” 
between  upper  bounds  on  errors  for  interpolants  of  the  form  (1),  and  lower 
bounds  on  the  smallest  eigenvalue  of  A.  Iterative  techniques  for  solving  such 
badly  conditioned  systems  often  suffer  from  poor  rates  of  convergence,  and 
therefore  computationally  expensive  direct  methods  have  to  be  employed. 

In  the  inverse  multiquadric  case,  large  values  of  c  achieve  good  initial  ap¬ 
proximations  to  smooth  data,  whilst  smaller  values  produce  functions  capable 
of  resolving  fine  detail.  Ideally,  such  properties  could  be  exploited  without 
having  to  solve  (2)  directly. 

Since  s  is  evaluated  at  points  y  ^  Xi  where  an  approximation  is  required, 
a  global  solution  incorporating  all  N  centres  may  be  inappropriate.  Further, 
it  is  unnecessary  to  find  A  such  that  ||/  -  i4A||oo  <  \^(y)  -  5(2/)|,  since  the 
accuracy  of  s[y)  is  limited  by  the  approximating  power  of  (j).  Rather  than 
searching  for  a  complete  global  solution,  this  suggests  that  attention  may  be 
focussed  on  small  regions  around  evaluation  points.  Moreover,  the  aim  is 
to  obtain  a  solution  such  that  the  residual  and  approximation  accuracy  are 
comparable,  for  little  is  to  be  gained  by  having  a  small  residual,  while  the 
approximation  power  of  the  basis  functions  limits  the  final  accuracy. 

In  Section  2,  local  approximations  are  used  to  convert  irregular  data  to 
a  regular  mesh  of  approximate  function  values.  Whilst  the  method  can  be 
generalised  to  ]R‘^,  the  description  and  examples  are  given  in  The  system 
of  equations  associated  with  the  gridded  data  is  inverted  and  used  to  solve 
subsequent  systems. 

Floater  &:  Iske  [1]  demonstrate  the  benefits  of  a  multi-level  approach 
to  approximation,  and  the  theoretical  foundation  is  provided  by  Narcowich, 
Schaback  &  Ward  [5].  Section  3  describes  the  present  hierarchical  iterative 
refinement  algorithm,  and  explains  the  computational  advantages  of  domain 
decomposition  and  the  use  of  a  stored  inverse. 

§2.  Local  Solutions  and  Gridding  Data 

If  the  function  T  is  not  arbitrary,  but  arises  from  a  physical  system,  then  some 
degree  of  smoothness  can  be  assumed.  A  smooth  data  set  can  be  significantly 
thinned  whilst  retaining  general  information  about  its  behaviour.  Floater  & 
Iske  [1]  demonstrate  that  Delaunay  triangulation  can  be  used  to  optimise  the 
uniformity  of  data,  and  provide  a  good  thinning  algorithm.  Such  triangulation 
and  assembling  of  data  is  computationally  expensive  for  excessively  large  N . 
An  0{N)  method  of  finding  uniform  approximate  data  is  presented. 

An  approximation  to  .7^  at  a  point  y  €  is  achieved  by  solving  a  small 
interpolation  problem  centred  on  y.  The  closest  q  points  in  X  to  ?/  are  inter¬ 
polated  by  inverse  multiquadrics  with  shape  parameter  ciocah  evaluated 
at  y.  Since  q  can  be  as  low  as  20  ~  30,  Ciocai  can  be  relatively  large  be¬ 
fore  the  matrix  ill-conditioning  becomes  unacceptable,  thus  yielding  a  good 
approximation. 
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This  method  is  highly  parallelizable,  and  large  data-sets  can  be  dealt 
with  without  the  need  of  assembling  or  storing  the  matrix  A. 

Finding  the  optimum  shape  parameter  on  regular  or  scattered  data  re¬ 
mains  an  open  problem,  as  shown  in  [3].  There  is  no  obvious  correlation 
between  point  spacing  and  a  good  choice  of  c/oca/  •  The  best  shape  parameter 
is  generally  found  by  increasing  the  value  of  ciocal  until  just  prior  to  machine 
precision  breakdown. 

Let  Y  =  {2/1, . . . ,  2/„2}  be  the  set  of  points  on  the  n  x  n  regular  unit  grid. 
If  the  previous  local  approximation  technique  is  applied  to  each  t/j,  then  the 
irregular  data  can  be  transformed  to  a  regular  grid  with  approximate  function 
values  /j.  The  aim  is  to  find  a  global  approximation  using  the  new  data  at 
the  grid  points. 

After  converting  scattered  data  to  a  regular  grid,  certain  approximation 
techniques  become  available  which  would  otherwise  have  been  difficult  to  im¬ 
plement.  Polynomial  tensor  product  splines  can  be  efficiently  employed  to 
approximate  a  solution  from  the  given  gridded  data.  To  find  such  an  approxi¬ 
mation  at  a  point  z,  z  must  lie  inside  a  (d-f- 1)  x  (d-t- 1)  subgrid  of  the  regular 
points,  where  d  is  the  degree  of  the  Lagrange  polynomials  to  be  used^  Let 
the  points  of  such  a  subgrid  be  labelled  ^ij  and  have  function  values  fij  for 
i,j  =  l,...,d-|-l.  The  univariate  Lagrange  polynomials  Li{x)  and  IA{y)  are 
constructed  such  that 


=  and  L>{U)  =  Sl 

The  polynomial  tensor  product  spline  is  defined  to  be 


4,ij{z)  =  L\z)-Lj{z). 


The  approximation  at  z  is  given  by 

d+l  d+l 
i=l  j  =  l 

Alternatively,  a  thinned  global  interpolant  of  the  form  (1)  can  be  achieved  by 
solving 


Bfj,  =  f, 


(3) 


where  =  4>{\\yi  ~  yj II)  )  =  [Mi  •  • '  and  /  =  [/i  •  •  • 

This  amounts  to  finding  an  interpolant  s  to  a  thinned  approximation  of 
the  initial  data.  The  local  approximation  errors  \f{yi)  -  f{yi)\  limit  the  final 
accuracy  of  s. 

The  inverse  of  B  need  only  be  computed  once,  and  then  stored  for  future 
use.  All  scattered  data  problems  can  then  be  scaled  and  transformed  to  the 
regular  grid  Y,  whereupon  fi  is  given  by  the  matrix- vector  product  fi  =  B~^f. 
Only  half  of  the  entries  of  B~^  need  to  be  stored  since  B~^  == 
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Fig.  1.  An  example  of  approximation  vs.  residual. 

Too  large  a  matrix  B  causes  storage  problems,  and  difficulties  in  calculat¬ 
ing  the  inverse.  As  c  increases,  the  approximation  improves,  but  the  residual 
\\f  —  B/i\\oo  grows.  A  value  for  c  is  chosen  before  the  approximation  begins  to 
deteriorate  due  to  the  rise  in  the  residual.  As  an  example,  the  function  T  =1 
is  approximated  on  the  unit  square  using  inverse  multiquadrics  by  interpolat¬ 
ing  fi  =  f  =  1, ...  ,400  using  (3).  The  approximation  is  evaluated  at 

1000  random  points.  The  results  in  Figure  1  are  typical  for  smooth  functions, 
but  the  consequent  choice  of  c  is  only  a  guide,  and  does  not  guarantee  success 
for  all  :F. 


§3.  Hierarchical  Iterative  Refinement 

The  hierarchical  method  uses  increcisingly  dense  subsets  of  X  to  refine  the 
current  approximation;  see  [5].  Let  Xk  =  C  X,  such  that 

A^fc+i  >  Let  Sk  be  the  current  approximation,  and  be  the  full  global 
residual  at  the  level, 


=  f{xi)-Sk{xi). 

Let  rfc  be  the  residual  over  the  points  in  Xk-  This  thinned  global 
residual  is  interpolated  by 


Nk 

ik{x)  =  (4) 

i=l 

where  II)  =  (c^-t-||a;-rCi||2)-i/2  ^  and  tk{xi)  =  rk{xi)  ,  i  = 

The  initial  interpolant  si  =  0  is  updated  by 


5fc  +  l  =  Sk  +  tk. 


(5) 


Multi-Level  Approximation  to  Scattered  Data 


251 


The  technique  of  gridding  data  in  Section  2  is  used  to  find  an  approximate 
function  value  for  every  point  in  Yp.  Therefore,  (5)  is  replaced  by  Sk+i  — 
Sfc  +  5  where  Sk  is  the  current  approximation  to  regular  approximate  data, 

Si  =  0. 

The  value  ci  can  be  relatively  large  to  give  a  good  initial  approximation. 
As  Nk  increases,  Ck  has  to  be  reduced  to  ensure  computational  solvability.  The 
decrease  in  Ck  introduces  tighter  basis  functions  which  improve  the  resolution 
of  the  approximation. 

A  method  of  data  thinning  is  required  to  determine  the  points  in  Xfc-  The 
dense  systems  arising  from  (4)  have  to  be  solved  directly,  but  this  is  imprac¬ 
tical  for  large  Nk.  To  overcome  such  complications,  domain  decomposition  is 
applied  to  each  Xk- 

The  levels  of  the  hierarchy  have  to  be  computed  sequentially,  but  by 
using  domain  decomposition  each  sub-domain  can  be  dealt  with  in  parallel. 
Moreover,  each  such  solution  only  requires  a  single  matrix-vector  product. 

To  put  this  in  the  current  context,  each  Xk  is  constructed  from  overlapping 
square  grids  Yp  ,  where  p  =  1, . . .  ,mfc.  These  square  grids  need  not  be  the 
same  size  or  of  similar  orientation,  but  must  contain  an  equal  number  of  points. 
Each  sub-domain  Yp  consists  of  an  inner  region,  where  the  approximation  is 
finally  evaluated,  and  an  overlap.  Special  attention  has  to  be  given  to  sub- 
domains  whose  edges  coincide  with  the  boundary  of  X. 

At  the  level,  mk  sub-domain  interpolation  problems  need  to  be  solved. 
Since  B  is  invariant  under  shifts  and  rotations  of  the  centres  yi,  the  stored 
B~^  can  be  invoked.  If  the  centres  are  scaled  pi  api,  this  amounts  to  a 
change  in  the  shape  parameter. 

Recall  that  5i<ij<n  =  Hhi  -  VjW)^  where  pi  G  [0, 1]  x  [0, 1].  Now, 

(f>{\\y  -  ViW)  =  \\y  - 

==  a{a^c^  +  \\ay  -  ayi\\^)~'^^‘^ . 

Let  Wi  =  api  and  define  ip{\\w  —  iyi||)  =  a{a^c^  +  ||i<;  —  Then 

Bi<i,j<n  =  4>i\\yi-yj\\)  =  where  Wi  G  [0, a] x [0, a].  Therefore 

by  using  the  matrix  B,  a  new  inverse  multiquadric  is  created  at  scaled  points 
with  shape  parameter  ac. 

Each  of  the  thinned  global  interpolation  problems  (4)  can  be  decomposed 
and  solved  by  multiple  applications  of  the  stored  inverse  B~^.  Continued  use 
of  the  same  inverse  naturally  introduces  tighter  basis  functions  suitable  for 
approximating  typical  residuals. 

§4.  Numerical  Results 

We  give  an  example  where  the  above  scheme  is  used  to  approximate  Franke’s 
function  [2]  over  10000  scattered  points  in  the  unit  square  in  IR^: 

T{U,  V)  =  Q  j5g-(9u-2)V49-(9v-2)Vl0 
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Level 

No.  of 

domains 

rrik 

Shape 

parameter 

Ck 

Overlap 

Max,  error  in 

gridded  data 

ll/-/ilU 

Max.  error  in 

solution 

r-  41100 

1 

1 

0.25 

0 

8.241  X  10“^ 

1.371  X  10“^ 

2 

4 

0.138 

1/36 

2.424  X  10-® 

1.838  X  10-5 

3 

16 

0.0688 

1/72 

3.189  X  10-® 

3.189  X  10-® 

Tab.  1.  Error  for  Franke’s  function. 

The  localised  interpolation  problems  are  solved  directly  using  Gauss  Elim¬ 
ination,  with  q  —  2Q  and  c/oca/  =  0.2.  It  is  the  error  function  at  each  level 
which  is  approximated  locally,  and  not  the  original  function  T.  The  square 
sub-domain  grids  Yp  are  comprised  of  21  x  21  equally  spaced  points.  For  ease 
of  implementation,  the  sub-domains  used  for  a  particular  level  are  of  equal 
size.  The  overlaps  between  sub-domains  therefore  consist  of  one  or  two  mesh 
points,  depending  on  position.  The  key  interpolation  matrix  B  is  constructed 
from  inverse  multiquadrics  with  yi  G  [0, 1]^,  c  =  0.25,  and  is  generated  us¬ 
ing  Matlab.  The  domain  decomposition  is  straightforward  on  the  unit  square 
with  ruk  =  4*“^  The  thinned  global  interpolants  Sk  are  evaluated  at  points 
ti  G  [0,1]^.  Table  1  shows  the  error  in  the  approximated  data  at  the  regular 
grid  points  \  f{yi)  -  /(?/i)|,  and  the  error  in  the  approximation  \^{ti)  —  5jk(ti)|. 
Figures  2  and  3  show  the  approximation  error  for  each  level. 

The  error  function  from  Level  1  clearly  demonstrates  the  ability  of  the  in¬ 
verse  multiquadric  to  approximate  smooth  data.  The  error  near  the  boundary 
is  scaled  by  an  order  of  magnitude  at  each  level,  but  has  the  same  general  be¬ 
haviour.  The  final  iteration  leaves  error  near  the  boundary,  aggravated  by  test 
points  being  outside  the  original  scattered  data  set.  Such  evaluation  points 
ought  to  be  included  since,  although  they  require  the  extrapolation  of  Sk  to 
evaluate,  the  experiment  was  specified  to  be  conducted  on  the  unit  square. 
The  original  aim  of  finding  a  solution  where  the  residual  is  comparable  to  the 
approximation  accuracy  is  fulfilled  at  Level  3. 

Example  I  is  repeated  as  far  as  the  regularization  of  data,  and  then  poly¬ 
nomial  tensor  product  splines  are  used  to  find  the  final  approximation,  as 
described  in  Section  2.  Such  splines  cannot  replace  the  inverse  multiquadric 
approximation  on  the  regular  grid  without  an  increase  in  error.  Such  an  er¬ 
ror  is  then  propagated  to  the  next  level  where  the  discrepancy  is  amplified. 
However,  if  the  hierarchical  refinement  procedure  is  abandoned,  then  these 
basis  functions  efficiently  yield  a  good  approximation.  Table  2  shows  the 
approximation  accuracy  for  such  splines  of  different  polynomial  degree  with¬ 
out  iterative  refinement.  The  grid  sizes  are  comparable  to  those  used  in  the 
original  example. 
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Grid  Size 

Linear 

Quadratic 

Cubic 

21  X  21 

41  X  41 

81  X  81 

2.2  X  10-2 

5.3  X  10-3 

1.4  X  10-3 

6.4  X  10-3 

9  X  10-^ 

1  X  10-^ 

3.5  X  10-3 

2.6  X  10-^ 

5.7  X  10-3 

Tab.  2.  Approximation  error  for  the  various  splines. 


Fig.  2.  Approximation  Error  for  Levels  1  and  2. 


Fig.  3.  Approximation  Error  for  Level  3. 

§5.  Conclusion 

A  global  solution  to  an  interpolation  problem  involving  a  large  number  of 
data  points  is  too  expensive  to  compute  directly  if  inverse  multiquadrics  are 
to  be  used  effectively.  However,  if  the  aim  is  to  generate  approximations  to  a 
function,  then  such  a  solution  is  unnecessary,  and  an  alternative  method  has 
been  presented. 

The  underlying  idea  is  to  transform  given  scattered  data  fi  at  points  Xi  to 
regular  approximate  data  fi  at  i/i,  which  is  easier  to  solve  for.  The  aim  is  then 
no  longer  to  interpolate  the  initial  data,  but  to  find  a  good  approximation  to 
it.  The  final  solution  s  is  an  approximation  to  /,  which  is  close  to  /.  Success 
relies  on  minimising  the  local  approximation  errors  \f(yi)  —  f{yi)\> 

The  algorithm  is  0{N)  since  the  only  work  related  to  the  number  of  initial 
points  is  the  search  for  the  q  closest  points  to  each  yi.  Such  a  search  can  be 
improved  by  making  assumptions  as  to  which  Xi  are  unlikely  to  qualify. 
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The  time  required  to  solve  each  sub-domain  problem  is  reduced  due  to 
the  use  of  the  stored  n  x  n  key  inverse  matrix.  Solving  directly  would  be 
O(n^),  but  the  required  matrix-vector  product  is  0{n^). 

The  hierarchical  iterative  refinement  strategy  produces  good  approxima¬ 
tions,  and  is  the  only  sequential  aspect  of  the  method.  The  search  for  ap¬ 
proximate  regular  data,  and  the  solutions  for  each  decomposed  sub-domain 
are  parallelizable  operations,  although  this  has  yet  to  be  implemented.  These 
features  mean  that  large  data  sets  can  be  dealt  with  in  acceptable  computing 
time. 
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Best  Approximation  Algorithms: 
A  Unified  Approach 


V.  V.  Kovtunets 


Abstract.  A  generalization  of  the  Remez  algorithm  is  proposed.  The 
new  approach  uses  differential  properties  of  the  best  approximation  opera¬ 
tor.  The  method  was  developed  for  polynomial  approximation  of  complex- 
value  functions.  In  this  paper  the  convergence  of  algorithm  is  proved  for 
Banach  spaces. 


§1.  Introduction 

Let  us  consider  the  best  approximation  operator 


where  B  is  a  Banach  space  (complex  in  general),  Pn  an  n-dimensional  sub¬ 
space.  Suppose  that  P„  is  univalent  and  one-side  differentiable  in  any  direc¬ 
tion  [1].  This  assumption  is  valid  when: 

i)  B  =  C{Q,  R)  Q-compact,  and  Pn  is  a  Chebyshev  subspace  (in  particular, 
when  Pn  is  the  subspace  of  algebraic  polynomials  of  degree  less  or  equal 
to  n  —  1  [2,9]); 

ii)  B  =  C{Q,C),Q  is  finite  set,  and  Pn  is  an  n-dimensional  Chebyshev 
subspace  [3]; 

iii)  3  —  Lp,  p  >  1,  and  Pn  is  an  n-dimensional  subspace  (R.  Holmes  and 
B.  Kripke). 

Originally  the  differential  properties  of  the  best  approximation  operator  were 
applied  to  the  development  of  algorithms  in  [4,5].  The  goal  was  to  generalize 
the  Remez  algorithm  for  complex- valued  functions.  In  [6]  the  new  best  approx¬ 
imation  algorithm  was  applied  to  the  approximation  of  conformal  mappings 
by  polynomials.  In  [7]  it  was  shown  that  for  real  polynomial  approximation. 
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such  an  approach  generates  exactly  the  Remez  algorithm,  and  a  stronger  con¬ 
vergence  theorem  was  proven  using  differentiation  technique.  In  [8,1],  the 
new  approach  was  applied  successfully  to  nonlinear  approximation,  including 
rational  and  generalized  rational  uniform  approximations. 

Here  we  show  that  the  method  is  applicable  to  the  best  approximation 
from  a  finite-dimensional  subspace  in  the  arbitrary  Banach  space  if  the  best 
approximation  operator  is  one-sided  differentiable. 

§2.  Description  of  the  Algorithm 

Suppose  that  the  i-th  step  of  the  algorithm  is  performed  to  find  the  best 
approximation  of  the  element  /  G  B,  and  that  the  element  Pi  G  Pn  is  found. 
If  |[/  —  Pill  =  E{f),  then  the  process  is  finished.  Otherwise  the  inequality 

holds,  and  the  next  step  should  be  performed. 

In  order  to  construct  the  next  approximation  Pi+i,  we  construct  an  aux¬ 
iliary  element  Qi  such  that  the  equality 


ii/+ft-^’iii  =  ii/-Pili  (1) 

holds.  Suppose  the  following  assumptions  are  true: 

Assumption  1.  The  mapping  G  =  G{f)  :  P„  — ^  B,  which  defines  the  auxil¬ 
iary  element  gi  =  G{f,Pi)  is  continuous. 

Assumption  2.  For  all  functionals  a:  G  B*  with  properties  \x{f  —  Pi)|  = 
11/  —  Pill,  ||a;||B>-  =  1,  equality  x{gi)  =  0  holds.  Moreover,  for  every  such 
extremal  functional  x,  a  weak  neighbourhood  V{x)  C  B*  exists  such  that 

Rey{f  -  Pi)y{gi)  >0,  Vy  G  V{x)  H  {z  G  B^  ||z||  =  1}. 

Assumption  3.  For  given  fixed  f  the  mapping  D  =  D{f)  :  B  ^  P„,  which 
defines  the  derivative  Di  =  D{f,gi),  is  continuous. 

These  assumptions  may  be  satisfied  easily  for  real  and  complex  uniform 
approximations  [4,5,7].  When  the  derivative 

dP{f  +  {l-t)g,) 

t=+0 

is  calculated  (usually  as  the  solution  of  system  of  linear  algebraic  equations), 
the  next  element  Pi 4.1  is  computed  as 

Pi+i -Pi  +  fiA,  (2) 

where  cvi  <  U  <  Ti,  0  <  c  =  const  <  1,  and  Ti  is  the  minimal  value  of  t,  for 
which 


Eiin)  =  mm{Ei{t)  ==  [[/  -  P,  -  tA||,0  <  t  <  1}. 
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§3.  Main  Theorem 

Theorem  1.  For  given  f  e  B\Pn,  the  sequence  constructed  accord¬ 

ing  to  the  general  scheme  (see  the  previous  section)  converges  to  the  element 
P(f)  of  best  approximation  of  /  G  B. 

Proof;  Let  us  write 

^i  =  Ilf -Pill, 

m  =  llf  +  (^-t)gi-P(f  +  (i-t)gi)ll 

Ei(t)  =  llf  +  (l-t)gi-Pi-tDil  (3) 

Bi(t)  =  \\f-Pi~tDil 
Cii  =  A-(+0)j 

E*  =  mm{Ei(t),0  <t<l}  =  Eiin),  i  >  0. 

The  convexity  of  the  function  Ei  (t)  implies 

E'C+O)  <  Eiil)  ~  Ei(0)  =  E(f)  -  E{f  +  gi)  =  E{f)  -  \\f  -  P,||  <  0. 
Since 


\Ei{t)  -  Ei{t)\  <  ||P(/  +  (1  -  t)gi)  -Pi-  tDiW  =  o(t),  t  ^  +0, 


the  equality 

E'ii+O)  =  E’{+0) 

holds.  Therefore, 

£'(+0)  <  £(/)  -  £i  <  0. 

(4) 

Now  we  show  that  there  exists  e  >  0  such  that 

^‘(')-®‘(‘)+ll/-P.|| 

(5) 

for  all  0  <  t  <  e.  Suppose  that  the  last  statement  is  invalid-  Then  there  is  a 
sequence  {t/},  ti  — ^  +0,  I  — >  oo,  such  that 


11/  -Pi-  tiDiW  >  11/  +  (1  -  t,}gi  -Pi-  UDiW  +  I  >  1- 

Choosing  a  subsequence  if  necessary,  we  may  consider  the  weakly  conver¬ 
gent  sequence  of  functionals  {a:j}  C  B*,  V/  \\xi\\  =  1,  such  that 

Mf  -Pi-  t,Di)\  >  11/  +  (1  -  t,)gi  -Pi-  t,Di)\\  +  *  >  1-  (6) 

Let  xq  =  limi_^oo  Then  inequality  (6)  implies 
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and  since  ||a:o||  =  1,  we  finally  have 

Hence  in  accordance  with  Assumption  2 

=  0.  (7) 

Since  ||a:;||B*  =  Ij  it  follows  from  (6)  that 

\xi(f  -Pi-  tiDi)\  >  \x,(f  +  (1  -  t,)gi  -  P.  -  t,Di))\  +  (6') 

for  I  >  1. 

Now  we  temporarily  write 

a  =  xi{f  -Pi-  tiDi), 
b  =  {l-ti)xi{gi), 

_  2tf\xi{Di)\'^ 

ii/-p<ii 

Using  this  notation,  the  inequality  (6')  may  be  rewritten  in  the  form 

\a\  >  |a  +  6|  +  5, 

which  implies  |a|  >  \a  +  b\.  Consequently, 

|al^  >  \a-\-  +  2|a  +  b\s  >  |a  +  6^  +  2|o|s. 


Thus, 

lap  +  |6p  +  2Rea6  <  |ap  -  2|a|s, 

and 

2Rea6<-|6p-2|al5.  (8) 

Now  we  substitute  the  values  of  a,  6,  s  in  (8)  and  obtain 

2Re2:/(/  -Pi-  tiDi)xi(gi){l  -  U) 

<  -(1  -  U)\i(gi)\'^  -  i\xiU  -P,- 


Since  t,  +0,  xi{f  -Pi-  ijA)  ->  \\f  -  P|l,  when  I  oo,  there  exists 
number  /q  such  that  inequalities 
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2Re xi{f  -  Pi)xi{gi) 

<  -(1  -  +  2tiRexi{Di)xi{gi)  - 


4tf\xi(f-Pi-tiDi)\\xi{Di)\‘^ 


{l-ti)\\f-Pi\\ 


<  -- 


+  2tiRexi{Di)xi{gi)  -  2tf\xi{Di)\^ 


=  ~-\xi{gi)  +  2xi{Di)\'^  <0 

are  valid  for  all  I  >  Iq.  Therefore, 


Rexi{f  -  Pi)xi{gi)  <0,  yi  >  Iq. 


But  taking  into  consideration  (7),  we  see  that  this  inequality  contradicts  As¬ 
sumption  2,  so  (5)  is  proven. 

Since  Ei{0)  =  Ei{0)  =  Ei{0),  (5)  implies 

E'{+0)\  <  Eii+0)  <  E{f)  -  Ei.  (4') 


Therefore,  in  accordance  with  (2), 


Ei{Ti)  =  min{\\f  -  Pi  -  tA||,0  <t  <1}  <  Ei  and  >  0. 


So 

Ei+i  <  Ei. 

Hence  the  sequence  converges  to  some  value  E^  >  E{f),  i.e., 

lim  Ei  =  E^. 

From  (2),  (4')  and  convexity  of  Ei(t),  it  follows  that 

Ei  -  Ei(ri) 


Ei+i  <  Ei  ~ 


n 


-U  <Ei-  c{Ei  -  Ei{Ti)). 


Consequently, 


AEi  =  Ei+i  -Ei>  c{Ei  -  Eiin)). 

Since  AEi  0,  also  Ei  —  Ei{Ti)  — >  0,  and 

limEi(ri)  =  limEi  =  E^. 


(9) 


(9') 


To  complete  the  proof  we  must  show  that 

E.  =  E{f). 


260 


V.  V.  Kovtunets 


Suppose  that  this  statement  isn’t  valid,  i.e., 

E.  >  E(f).  (10) 

Since  the  subspace  Pn  is  finite-dimensional  and  Assumptions  1  and  3  hold,  the 
subsequence  =  {P*:}  may  be  chosen  so  that  the  following  limits  exist: 

i)  limPA:  =  P*; 

ii)  limpi,,  =  p*; 

iii)  lim aij^.  =  a*; 

iv)  limr^,^  =  r*. 

Prom  Assumption  3  it  follows  that  also 

limPj^^  = 


exists,  and 

limll/  -  Pi,  -  tDi,\\  =  11/  -  P.-tD,\\  (11) 

uniformly  for  t  e  [0,1].  Equalities  (9)  and  (9’)  imply  that  at  least  one  of 
following  statements 

a*  =  lim  ttfc .  =  0;  (12) 

or 

r*  =  limrjk.  =0  (13) 

is  valid. 

Using  the  assumption  (10)  and  the  scheme  of  the  algorithm,  we  construct 
the  auxiliary  element  p  /  0  for  the  approximation  P* .  Due  to  Assumption  1 , 
we  have 

9*  =9‘ 

For  the  following  two  convex  functions 


£.(<)  =  11/  +  (1  - 1)9.  -  p(/  +  (1  -  t)g.)\\, 


where 


dPjf  +  (1  -  t)g,) 
dt 


U=-i-o, 


analogously  to  (4'),  we  obtain  inequalities 


(14) 


Ki+O)  =  £:(+0)  <  E{f)  -E.<0 
and  f  >  0,  where  r  is  a  minimal  value  of  t,  for  which 
E.(f)  =  min{E.(t),  0  <  t  <  1}. 


Best  Approximation  Algorithms 


261 


From  (11),  it  follows  that  r*  =  f  and  therefore  (13)  is  impossible.  So  (12)  is 
true.  But  (14)  implies  that  there  is  an  integer  ko  such,  that  for  every  k,k  >  ko, 
the  inequality 


< 


2 


<0 


is  valid,  and  therefore 


a*  <  0. 


This  inequality  contradicts  (12).  Hence  assumption  (10)  is  invalid.  The  theo¬ 
rem  is  proven.  □ 


§3.  Applications 

As  was  mentioned  above,  the  proposed  algorithm  may  be  considered  as 
a  wide  generalization  of  the  classical  Remez  algorithm.  Applied  to  polyno¬ 
mial  approximations  of  complex-valued  functions,  the  method  generates  an 
algorithm  which  possesses  in  general  a  linear  convergence  as  numerical  exper¬ 
iments  show  (see  also  [6]).  For  finite  sets,  the  convergence  of  the  algorithm  is 
quadratical. 

When  applied  to  real  polynomial  approximations,  the  method  generates 
exactly  the  Remez  algorithm  [7].  But  even  in  this  case  an  approach  which 
uses  differential  properties  of  the  best  approximation  operator  allows  better 
estimations  of  convergence. 

Theorem  2.  ([7]).  Let  Pn  be  a  n-dimensional  Chebysbev  subspace  in  C[a,  b], 
and  let  A^('a,  x)  be  the  second  difference  of  the  function  u  at  the  point  x  with 
step  h.  If  the  function  f  6  C[a,  6]\Pn  has  the  best  approximation  P(/)  G  Pn 
such  that  the  difference  f  -  P(/)  possesses  exactly  n  -H  1  extremal  points 
Xo,xi, . . .  ,Xn  and  the  inequality 

\Al(f  -P(f),xj)\>jh^,  j  =  const,  j  =  0,l,...,n 

holds  in  points  xj,  then  the  Remez  algorithm  for  f  converges  quadratically. 

In  [7]  a  modified  Remez  algorithm  for  twice  continuously  differentiable 
functions  is  proposed.  A  procedure  for  extremal  points  calculation,  using 
differential  properties,  is  developed  to  reduce  the  complexity  of  the  most  com¬ 
plicated  part  of  the  Remez  algorithm. 

This  method  may  be  applied  to  the  best  polynomial  Lp—  approximation. 
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On  Curve  Interpolation  in 


Jernej  Kozak  and  Emil  Zagar 


Abstract.  In  this  paper  the  interpolation  by  continuous  spline  curves 
of  degree  n  in  is  studied.  There  are  r  interior  and  two  boundary  data 
points  interpolated  on  each  segment  of  the  spline  curve.  The  general  form 
of  the  spline  curve,  as  well  as  the  defining  system  of  nonlinear  equations 
are  derived.  The  asymptotic  existence  of  the  solution,  and  the  approxi¬ 
mation  order  are  studied  for  the  polynomial  case  only.  It  is  shown  that 
the  optimal  approximation  order  is  achieved,  and  asymptotic  existence  is 
established  provided  the  relation  r  =  n  —  2  is  satisfied.  These  conclusions 
hold  independently  of  d.  It  is  also  pointed  out  that  the  underlying  analysis 
could  not  be  carried  over  to  the  case  r  =  n  —  1. 


§1.  Introduction 

The  interpolation  problem  considered  is  the  following.  Let  the  points 

To,Ti,.. .  ,Tm  Tj  all  j,  d>2,  (1) 

and  the  tangent  directions 

do,dN  (2) 

at  the  boundary  points  be  given.  Find  a  continuous  spline  curve  Bn  of 
degree  n  which  interpolates  the  prescribed  data. 

The  problem  appeared  first  as  a  particular  limit  case  in  [2],  and  was 
further  generalized  in  several  papers,  among  them  in  [3-5,6,9-10].  A  general 
approach  to  the  approximation  order  achieved  can  be  found  in  [8] . 

Here,  the  general  setup  is  tackled.  The  interpolating  spline  curve  in  the 
Lagrange  form  is  established  and  the  defining  system  of  nonlinear  equations 
is  derived  in  general.  However,  the  asymptotic  existence  of  the  solution  (i.e. 
the  existence  of  the  solution  when  given  points  are  sampled  densely  enough) 
and  the  approximation  order  turned  out  too  comprehensive  to  be  studied  here 
in  a  general  framework.  The  positive  conclusions  for  the  single  segment  case 
when  r  =  n  ~  2  are  established.  It  is  possible  to  extend  these  results  to 
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the  m-segment  spline  curve,  but  the  proofs  are  not  short,  and  will  appear 
elsewhere.  On  the  contrary,  as  one  could  guess  from  [8],  the  case  r  =  n  -  1  is 
not  encouraging. 

Why  would  one  use  the  G^-continuous  splines  as  interpolating  curves? 
Quite  clearly,  the  derivative  continuity  at  the  breakpoints  becomes  in  this 
way  independent  of  the  local  parametrisation.  Also,  these  curves  could  be 
seen  as  a  generalization  of  the  odd  order  spline  function  interpolation  at  knots, 
applied  so  successfully  in  many  cases.  The  order  of  G-continuity  2  is  pinned 
down  by  the  human  eye,  sometimes  quite  important  in  CAGD;  it  can  detect 
the  continuity,  the  continuity  of  the  tangent  direction  and  the  curvature,  but 
hardly  higher  order  geometric  quantities. 

Throughout  the  paper  bold  faced  letters  will  stand  for  vectors,  and  or¬ 
dinary  ones  for  scalars.  The  dot  product  on  will  be  denoted  by  •  and  its 
implied  norm  by  H-H.  Derivatives  with  respect  to  the  global  (or  local)  pa¬ 
rameter  will  be  denoted  by  *  (or  d/dC)>  and  those  with  respect  to  the  natural 
parameter  by  '. 

Now  let  Bn  be  a  continuous  spline  curve  of  degree  n  with  m  segments 

corresponding  to  the  breakpoints 

Co  ^  Cl  <  •  •  •  <  Cm  • 

We  suppose  that  the  pieces  are  locally  parametrized  on  [0, 1]  as 

B(C)  =  B'(^^^).  Ce  [0-1,0]. 

Suppose  B  interpolates  the  data  (1),  and  (2).  If  r  interior  and  two  boundary 
points  are  to  be  met  on  each  segment,  then  N  =  m(r  +  1).  Further,  on  the 
^-th  segment  the  interpolation  conditions  read 

B*(0,j)  =  T;j- ;=T(,_i)(,.+i)+j,  j  =  0,l,...,r+l,  £  =  1, 2, . . ,  ,ra,  (3) 

where 

0  =:  te^o  <  ti^i  <  •  •  •  <  '=  1? 

and  are  the  unknown  parameters  to  be  determined.  Let  x  Ay  ^ 

{xiyj  -  Xjyi)i<:j  denote  the  2-wedge  product.  The  geometric  continuity  of  B 
requires  the  tangent  direction 

-^B  (4) 

l|B|| 

-1— B  aB 

IIBIP 


as  well  as  the  curvature 


(5) 
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de 


Fig.  1.  An  interpolating  spline  curve  with  three  segments  . 


to  be  continuous  at  the  breakpoints.  Additionally,  at  the  boundary  points  the 
tangent  directions  do  and  djv  have  to  be  interpolated  too,  i.e., 

do  A  .B(Co)  =  B{(m)  A  div  =  0.  (6) 


Fig.  1  gives  an  example  of  such  an  interpolating  spline  curve  for  r  =  1,  n  =  3, 
and  d  =  3.  A  brief  look  at  the  conditions  (3)-(6)  reveals  that  the  number  of  in¬ 
dependent  equations  would  be  equal  to  the  number  of  independent  unknowns 
if 

dn-(d-l)r  =  3d-2.  (7) 


As  already  observed  in  [5],  for  fixed  d  this  Diophantine  equation  always  has 
an  infinite  number  of  nonnegative  solutions.  The  following  lemma  gives  its 
general  solution. 

Lemma  1.  The  possible  choices  of  pairs  r  and  n  that  satisfy  (7)  for  fixed  d 
are  given  by 


r  =  d  — 2  +  dfc,  n  =  d-\- {d  —  l)k,  A:  =  0, 1, -  (8) 

Proof:  The  relation  (7)  can  be  rewritten  as 

d(n  -  d)  -  (d  -  l)(r  -  d  +  2)  =  0. 

Since  d  >  2,  the  numbers  d  and  d  —  1  are  relatively  prime.  So  d  must  divide 
r  —  d  2,  and  d  —  1  must  divide  n  —  dj  i.e., 

r-d  +  2_7i-d_, 

d  ~d-l~ 

for  an  integer  k.  But  r  =  d-  2  +  dk>0  implies  k  >  ^  -  1  >  -1,  and  the 
conclusion  follows.  □ 
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§2.  The  Defining  Equations 

Several  approaches  were  used  to  simplify  the  conditions  (3)-(6)  for  particular 
choices  of  d,  n,  and  r.  Here  we  show  that  this  can  be  done  in  general,  which 
will  provide  an  opportunity  to  unify  the  computer  programs.  Let  us  consider 
a  single  segment  first.  In  this  case,  the  data  to  be  interpolated  are  the  points 
Tj+i,  as  well  as  tangent  directions  do)  ^r+i  at  the 
boundary  points.  Suppose  r  and  n  are  given  by  (8).  Consider  the  case  n  =  r+2 
first,  i.e.,  /c  =  0.  The  interpolating  polynomial  curve  can  be  written  explicitly 
in  Lagrange  form  as 

r+l 

B  1=  bu)  ^  ^  T  jjCj 
j=o 

with 

(9) 

tj  :=  hj,  and  the  values  to  be  determined.  Here  b  denotes  the 

unknown  leading  coefficient  vector.  If  /c  >  1,  one  has 

r  +  2  =  d{k  +  1)  >  d{k  +  1)  -  k  =  n, 

and  B  is  of  degree  at  most  r  +  l,  i.e., 

r  +  l 

B  =  Y,TiCj. 

j=0 

In  particular,  this  imposes  additional  conditions 

r+l 

degree  ^^TjCj  <n  (10) 

for  fc  >  1.  An  easy  way  to  meet  the  tangent  direction  conditions  (6)  is  to 
introduce  two  additional  (strictly  positive)  real  unknowns,  ao  and  a^+i ,  and 
require 

.H(io)  Qodoj  B{tr-\-l)  —  <ar+idr+i-  (1^) 

Let 

T_i  =  To  •—  to»  '^j  ^ji  J  —  Ij  2,  .  .  .  ,  r,  Tr^2  —  "^r+l  •—  ^r  +  1  •  (^2) 

Since  B  is  a  polynomial  of  degree  <  n,  the  divided  difference,  based  upon 

n+2=r+4-fc 

points  maps  it  to  zero.  So  the  conditions  (11)  and  (10)  can  be  written  in  a 
compact  form  as 

[■^j  — 1  j  +;+r+2  — =  0,  J  =  0,  1,  .  .  .  ,  A;, 


(13) 
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which  is  a  system  of  d{k  +  1)  nonlinear  equations  for  r  +  2  =  d{k  +  1)  scalar 
unknowns 

«05^1j^25  •  •  ■  j^rjCKr+l-  (14) 

In  the  case  n  =  r  +  2,  one  has  to  determine  additionally  the  coefficient  vector 
b,  for  example  as 


b  —  \tO’)tQ,ti,  .  ,  .  jtn  —  [tOj  •  ■  •  >  ^r+1  j  tr+l]^  •  (1^) 

Now,  for  an  m-segment  spline  curve,  the  directions  ^  =  1, 2, . . . ,  m  —  1,  are 

unknown,  as  well  as 

Oi£,0,ti^l,t£^2^-’ ■  ,U,r,OCi^r+l,  ^=1,2,  (16) 

But  one  can  still  write  the  interpolation  conditions  on  the  ^-th  segment  as 

—  0}  j  0,  1,  .  .  .  ,  fc,  (1^) 

where  rgj  are  defined  as  in  (12),  but  this  time  for  the  composite  case.  In 
addition,  the  missing  [d  —  l)(m  —  1)  equations  are  supplied  by  the  continuity 
conditions  of  the  curvature  (5). 

§3.  Asymptotic  Existence  and  Approximation  Order 

The  system  of  equations  based  on  (17)  and  continuity  of  curvature  (5)  is  non¬ 
linear,  and  one  of  the  approaches  to  study  it  is  to  assume  that  the  data  (1)  and 
(2)  are  based  upon  a  smooth  underlying  regular  parametric  curve  /  :  /  — >• 
parametrized  by  the  arclength  s.  The  local  expansion  of  the  curve  /,  and  the 
data  Ti^j  (sampled  densely  enough),  give  rise  to  an  asymptotic  analysis  of  the 
nonlinear  system.  The  simplest  way  to  obtain  the  local  expansion  is  to  use  the 
Prenet  frame  as  the  local  coordinate  system,  and  the  Frenet-Serret  formulae 
to  obtain  this  expansion.  Let  (et(s))f_i  denote  the  Prenet  frame,  with 

/'  =  ei.  (18) 


The  Frenet-Serret  formulae  read 
ei(s)  =  ^1(5)62(5), 

ei{s)  = -Ki^i{s)ei^i{s) Ki{s)ei+i{s),  i  =  2, 3, . . . ,  d  -  1,  (19) 

ed(s)  =  -Kd-i(s)ed-i(s), 

where  Ki  are  first  d  —  1  principal  curvatures  of  /,  expanded  as 

«i(5)  =  Kio  +  -h  +  •  •  •  (20) 

Since  /  is  a  regular  curve,  Kio  >  const  >0,  i  =  1,2, . . .  ,d  —  2.  We  will 
additionally  assume  that  ^  const  >  0.  Beginning  with  (18),  the  higher 
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derivatives  of  /  can  be  computed  by  (19)  and  (20).  This  produces  the  required 
expansion 

/W  =  /(o)  +  /'(o).  +  ^/"(o)s=  +  -'- 

=  /(0)  +  (s- +  (21) 

+  (^Kl.QS^  +  - )^2(0)  +  (-Kl,0«2,0S^  H - )e3(0)  H - 

Let  us  now  consider  the  single  segment  case  of  the  interpolation  problem 
with  data  based  on  a  smooth  /  :  [0,  h]  — > 

rfo  =  flVok),  Tj  =  f{r}jh),  i  =  0, 1, . . . ,  r  +  1,  rfr+i  =  f 

with  points  separated  independently  oih,  i.e., 

0  =:  7/0  <  m  <  •  •  •  <  <  7/r+i  :=  L 

Since  translation  and  rotation  do  not  influence  the  asymptotic  analysis,  we 
may  assume  /(O)  =  0,  and 

ei(0)  =  fc),ti,  i  =  l,2,...,d.  (22) 

Then,  with  the  help  of  (21),  one  obtains 

n-mh)  =  ( n  N.O  +  ,  (23) 

and  a  similar  expression  for  f'{h).  Since  the  divided  difference  is  a  linear 
functional,  we  can  normalize  the  system  (13)  by  multiplying  the  data  values 
by  D~^,  with 

D  :=  diag  I  Tj  h*  JJ 

\  '  9=1 

Let  f{s)  :=  {s^)f-i  denote  the  leading  part  of  the  normalized  /.  Then 

i^O)  ^05  (^1  j  ■  •  •  j  ?  ^r+l  >  ^r+l]-^  —  [^Oi  ^0)  ^1  ?  •  •  *  5  j  ^r+l?  ^r+l]^  +  ^(^)> 

(24) 

and  B  is  a  polynomial  of  degree  <  n  =  r  +  2  that  satisfies  the  interpolation 
conditions 

B{tj)  =  ajf{T}j),  i  =  0,r  +  l, 

B{tj)  ^f(r]j),  i  =  0,l,...,r,r  +  l, 

a„:=-,  «r+i:=-^. 


where 


269 


On  Curve  Interpolation  in 

Note  that  all  the  components  of  /  are  polynomials  of  degree  <  d  =  r +  2.  This 
implies  that  _ 

[T]o,r)o,7}i,...,r)r,Vr+l,Vr+l]7  =  0,  (25) 

and  the  solution  of  (24)  in  the  limit  /i  0  now  reads  as 

t  :=  (Oq,  £2  J  •  •  •  J  ~  (^5  J  ^2  J  •  •  *  )  ^rj  1)'  (26) 

To  prove  the  existence  of  the  solution  for  h  small  enough,  it  is  sufficient  to 
show  that  the  Jacobian  of  the  system  (24)  is  nonsingular  at  the  limit  (26). 
The  Jacobian  will  be  determined  with  the  help  of  the  following  fact:  if  xj  is 
different  from  all  the  other  points  xi,  and  if  a  function  g  is  smooth  enough, 
one  has 


g'(xj) 
n(®j  -*i) 

i^j 


l..,Xj,Xj,...]g  - 


n(^i 


(27) 


Consider  now  B  =  (B  — /)  +/.  Since  /  =  0  at  all  its  partial  derivatives 
with  respect  to  tj  vanish,  and  this  difference  contributes  to  the  Jacobian  at 
the  limit  point  t*  only  in  the  first  and  last  column,  i.e., 


d 


o  [^0)  ^Oj  •  •  ■  )  ^r+l>  ^r+l]  (-S  ^)l  *  — 


dao 

d 


t*  (V0-Vr+l)^'{V0) 

1 


?  (^0), 


/)|^,  -  (*7r+l), 


where  u)  is  given  by  (9),  and 


(28) 


uj  :=  c>j\ 


The  polynomial  curve  /  does  not  depend  on  So,  cfr+i,  and  from  (27)  and  (25) 
one  obtains  the  columns  2, 3, . . . ,  r  +  1  with  j  =  1, 2, . . . ,  r  as 


( dt  ^1)  •  •  •  >  >  ^r+l  J  ^r+l]^  f\t*  ~  [g  ■  — 


Vo){Vj  -  r]r+i)^'{tj) 


/(%■)• 


It  is  now  straightforward  to  see  that  the  Jacobian  at  t*  is  the  Vandermonde 
matrix  V{go,gi^ . . .  ,gr+i),  multiplied  by  Di  :=  diag(z)f_i  from  the  left,  and 
by 


£>2  :=  diag  (  - 


’  ??l(l  -  ’  ’?r(l  -  Vr)^'{Vr)  ’  ^'{Vr+l) 


from  the  right.  This  prepares  the  proof  of  the  following  theorem. 
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Theorem  2.  The  system  (13)  has  a  unique  solution  for  h  small  enough.  The 
approximation  order  of  the  resulting  interpolating  polynomial  curve  Bn  is 
optimal,  i.e.,  r  +  4  =  n  +  2. 

Proof:  Since  the  matrices  V{t]o,t)i,  . . .  ,7]r+i),  Di  and  D2  are  nonsingular, 
the  Jacobian  at  the  limit  point  t*  is  nonsingular,  too,  and  the  existence  of  a 
unique  solution  for  h  small  enough  is  established.  Furthermore,  the  unknown 
parameters  are  of  the  form 

ao  =  Or+i  =  h  +  tj  =  T]j  +  0{h),  j  =  l,2,  (29) 

Since  there  are  r  +  2  points,  as  well  as  two  directions  interpolated,  the  optimal 
approximation  order  is  quite  clearly  <  r  +  4.  The  proof  will  now  follow  the 
approach  applied  in  [2],  and  extended  in  [5].  It  is  based  on  a  reparametrisation 
that  transforms  the  direction  interpolation  to  the  derivative  interpolation,  and 
gives  an  estimate  of  the  parametric  approximation  order  as  defined  in  [7]. 
Recall  (22),  and  the  fact  that  interpolation  conditions  are  satisfied.  By  [2] 
and  [5],  it  is  now  enough  to  confirm  that  all  the  components  of  /  and  B  can 
be  reparametrized  by  the  ordinate  of  the  first  component  of  both  curves.  As 
to  /,  for  h  small  enough  this  fact  is  obvious.  The  first  component  behaves 
by  (21)  as  s  +  O(s^),  and  the  others  at  least  as  (9(s^).  To  establish  the  same 
conclusion  for  B,  it  is  enough  to  show  that 

B  =  ch{6u)U  +  0{h%  cjtO.  (30) 

Further,  the  optimal  approximation  order  proof  depends  on  the  additional 
relations 

B^^'^  =  Oih^),  g  =  2,3,..,,r  +  2.  (31) 

The  result  required  then  follows  from  the  standard  error  estimate  of  interpo¬ 
lation,  and  the  fact  that  the  (r-f4)-th  derivative  of  B  with  respect  to  the  new 
parameter  is  bounded  independently  of  h.  Let  us  verify  the  relations  (30)  and 
(31).  Recall  first 

t"  =  5[  =  0,l,...,r  +  l,  =  (32) 

j=o  i=o 

The  divided  difference  maps  polynomials  of  degree  < 

r  +  l  =  d— Ito  zero,  and  depends  continuously  on  its  arguments  if  applied  to 
a  smooth  function.  Thus  b  by  (15)  and  (23)  near  the  limit  point  t*  behaves 
like 

b={0{h'‘),0{h‘‘)....,0{h‘‘),xdh'‘  +  0{h<‘+^))  , 

where  Xi  =  11^=0  N.o  >  0-  On  the  other  hand,  (29)  and  (32)  imply  that 

r+l  r+l 

j=0  j=0 

=  (xi  h i, . . . ,  xd-1  h'‘-^  t"-' ,  Xd  h''  («"  -  a;(t)))"'  (1  +  0{h)). 
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r+l 

■£,(<)<«>,  9  =  1,2, . . .  ,r  +  2, 

j=0 

and  (31)  follows.  The  proof  is  complete.  □ 

There  is  no  hope  that  this  approach  could  be  used  for  all  k.  In  fact,  it 
fails  already  for  fc  =  1,  as  we  will  show  now.  By  (13),  the  equation  (24)  is 
replaced  by 

■■■  ytr,  tr+i\B  +  0{h), 

[to,ti,  .  .  .  ,tr,tr+l,tr+l]D~^B  =  [to,ti,  .  .  .  ,tr,tr+l,tr+l]B  +  0{h). 

Further,  as  in  the  proof  of  Theorem  2,  the  first  column  of  the  Jacobian  is 
determined  from 

—  [to,to,tl,...,lr,ir+ll(B-7)|^,  =  -7^7  (90), 

d  ^  ^ 

.  .  >  ,tr,tr+l,tr+l]{B  -  f)  =  0, 

the  last  column  from 

[^Oj  ^1)  •  •  •  5  )  ^r+l]  (-®  ~  1^*  ~ 

and  the  other  columns  from 

(^[l0,i0,<l,...,<r,«r+l])  7|C  =  ~(r,j-1]o)w'{tj)^ 

( Jjllo.ll,  •  ■  .  ,  tr,tr+l,<r+l])  7|c  =  -  -  Ttr+lWitj)^ 

After  normalizing  the  Jacobian  from  the  left  by  and  by  from  the 
right  one  obtains  the  matrix  A  :=  (aij)?j=i  with 

^i,l  —  2  =  1,2,...,  2(i, 

^i,2d  ~  0)  0.i+d,2d  —  2  =  1,  2,  .  .  .  ,  d, 

and 

ai,j  =  -  7]^Zv  ^i+dj  =  Ij-i,  2  =  1, 2, . . . ,  d,  j  =  2, 3, . . . ,  2d  -  1. 

A  simple  rank  preserving  transformation 

Uj jj  I  ^  5  ^  —  2d, 2d  l,...,d”f'  1,  j  —  Ij^,..., 2d, 

transforms  A  to  a  matrix  with  row  i  equal  to  row  2  +  d  for  2  =  2, 3, . . . ,  d.  It 
is  now  easy  to  see  that  the  rank  of  the  matrix  A  is  d  +  1,  and  consequently 
dim  ker  A  =  d  —  1.  Thus,  since  the  Jacobian  is  singular,  some  other  approach 
such  as  [1],  pp.  154-155,  should  be  applied  to  carry  out  the  asymptotic  anal¬ 
ysis. 
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Interpolating  Involute  Curves 

Mitsuru  Kuroda  and  Shinji  Mukai 


Abstract.  We  propose  a  straightforward  method  for  designing  an  in¬ 
terpolating  involute  curve  whose  radius  of  curvature  is  piecewise  linear  or 
quadratic  with  respect  to  winding  angle.  Designers  can  specify  and  con¬ 
trol  the  curvature  radius  profile  to  a  certain  extent.  End  radii  of  a  circle 
involute  are  solved  in  terms  of  end  tangent  angles,  and  a  involute  curve 
is  derived  by  the  Hermite  interpolation.  For  and  G^  involute  curves, 
relevant  nonlinear  equations  are  solved  by  the  Newton-Raphson  method. 
NC  machines  with  an  involute  generator  can  draw  the  resulting  curves 
with  “reduced  data”. 


§1.  Introduction 

We  present  a  new  method  for  designing  two  kinds  of  smooth  interpolating 
curves,  smooth  in  the  sense  of  consisting  of  less  segments  with  continuous 
monotone  curvature  radius  plot.  In  the  method,  we  derive  a  (curvature 
continuous)  involute  of  circular  arcs  or  a  involute  of  circle  involute  arcs 
through  describing  its  radius  of  curvature  that  is  piecewise  linear  or  quadratic 
with  respect  to  winding  angle. 

“The  most  important  curve  in  engineering  is  arguably  the  circle  involute 
...  it  has  played  key  historical  roles  in  a  variety  of  scientific  and  technological 
applications”  [3].  This  curve  has  excellent  shape  properties  which  make  it 
interesting  for  CAGD  (Computer  Aided  Geometric  Design),  One  can  draw  the 
involute  curve  manually  with  simple  equipment  if  necessary.  NC  (Numerical 
Control)  machines  with  an  involute  generator  are  available  [1,5]. 

In  our  straightforward  design  method,  designers  can  specify  tangents  and 
curvatures  at  junction  points,  and  control  the  curvature  profile  directly  to  a 
certain  extent.  End  radii  of  a  circle  involute  are  solved  in  terms  of  the  end 
tangent  angles,  and  so  an  interpolating  involute  curve  is  derived  span  by 
span  by  the  two-point  Hermite  interpolation.  For  and  involute  curves, 
continuity  conditions  and  other  requirements  lead  to  a  system  of  nonlinear 
equations.  We  solve  this  equation  system  by  the  Newton-Raphson  method, 
using  initial  values  from  the  conventional  (7^  cubic  spline  curve.  We  obtain 
examples  of  the  curves  satisfying  additional  requirements,  and  illustrate  the 
properties  of  the  newly  developed  curves. 
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Fig.  1.  Circle  involute  and  its  evolute. 


§2.  Circle  Involute  Arc 

A  planar  curve  r(s)  is  expressed  as 

r(s)=Po  +  ^  -oo<0<oo,  (1) 

where  s  is  arclength  from  the  starting  point  Po  0  is  a  winding  angle  (the 
angle  between  tangent  vector  and  the  direction  of  the  x  axis).  The  following 
relations  hold  among  the  curve  r,  unit  tangent  vector  t,  unit  normal  vector  n 
and  radius  of  curvature  p: 


_  1 

ds  ds  p 


(2) 


The  radius  of  curvature  of  circle  involute  is  proportional  with  respect  to  6\ 


P  = 


ds 

dO 


=  Pa-\-  -  ^o), 


(3) 


_  Pb  -  pa 


—  const., 


p{0o)  =  Pa,  p{0l)=Pb, 


where  p.  is  the  radius  of  circular  arc  that  is  the  evolute  of  r,  and  A  is  the 
forward  difference  operator  defined  by  Azi  =  Zi^i-Zi.  We  change  the  variable 
of  the  expression  (1)  from  s  to  0  by  the  relation  (3)  and  integrate  it: 


p9\ 

Apo  =  /  ptdO  =  patio  +  p{ti  -  to)  -  Pbtii,  (4) 

Joo 


r{0o)=Po^  t{0o)=to,  n{do)  =  no, 

r((9i)=Pi,  t{ei)=ti,  n(6>i)  =  ni. 

The  vector  equation  (4)  is  understood  easily  as  in  Fig.  1. 
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The  expression  (4)  is  a  system  of  equations  with  respect  to  unknowns  pa  and 
pb-  We  can  solve  this  as  follows: 

_  A^o  cos(^i  -  (/>o)  +  sin(^Q  -  (/>o)  -  sin(^i  -  <j)o)  ^ 

-2  +  2cosA^o  + A^osinA(9o 
_  AOq  cos{6o  -  (t>o)  +  sin(^Q  -  0o)  -  sin(^i  -  0o)  ^ 

-2  +  2  cos  A^o  +  A^o  sin  A^o  ° 

Since  the  involute  arc  is  given  in  terms  of  start  and  end  points  as  well  as  cor¬ 
responding  tangent  vectors  by  the  expression  (5),  we  can  obtain  a  involute 
curve  by  the  Hermite  interpolation  which  satisfies  the  equations 

r{0i)=Pi,  i  =  0,l,...,n.  (6) 


Radii  of  segments  of  the  evolute  of  r  are  rewritten  as 


cos(^i  -(f>i)-  cos{9i+i  -  00 
—2  +  2  cos  A9i  +  A9i  sin  A9i  *  ’ 


i  =  0,  1, . . .  ,n  -  1. 


§3.  Interpolating  Involute  Curve 

We  can  also  derive  an  interpolating  involute  curve  of  circular  arcs.  Using 
the  equation  (5),  we  can  solve  the  following  nonlinear  equation  system  (7) 
with  respect  to  unknowns  ^o,  ^i, .  •  •  by  the  Newton-Raphson  method: 

Pi  =  p{-9i)  =  p{+9i),  z  =  1,  2,  . . . ,  n  -  1.  (7) 

However,  the  Jacobian  matrix  necssary  for  the  method  makes  a  programming 
code  long  and  convergence  relatively  slow.  Therefore,  from  the  practical  point 
of  view,  we  prefer  to  solve  the  following  equations  (8)  based  on  the  equation  (4) 
directly.  Adding  unknowns  po,  pi, . . . ,  p„  to  the  previous  ones  9i,. . .  ,9n, 
we  get 

Pi'^i  +  PiAti  pj^-iTlj+i  =  Li  ^  0  •  ^  1,  .  .  .  ,  71  1,  (8) 

^Pi  •  n  1  1 

—  A^-  ’  ^  1, . . .  ,7r  1. 

Fig.  2  shows  an  example  of  interpolating  G"^  involute  curves  and  its  profile 
of  curvature  and  radius  of  curvature.  Initial  values  were  from  the  conventional 
C^  cubic  splines.  The  Newton-Raphson  method  converged  after  three  itera¬ 
tions.  In  spite  of  the  unpleasant  configuration  of  data  points,  the  curve  derived 
is  quite  smooth.  Its  evolute  curve  (circular  arcs)  is  G^  continuous  except  for 
two  cusp  points  that  correspond  to  extremal  points  of  the  radius  of  curvature. 
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Based  on  these  conditions,  we  are  going  to  derive  a  set  of  equations  with 
unknown  parameters  of  p{9)  and  solve.  We  use  the  following  notation  in  p— 6 
space: 

1)  Knots:  $-1,  ^0)  •  •  • »  where  the  end  knots  are  of  multiplicity  2. 

0—1  =  Oq,  ^n+l  ~  On. 


2)  de  Boor  ordinates  [2]:  do,  di,...,  d„+i,  where  di  corresponds  to  the 
Greville  abscissa  (0i_i  +  0i)/2. 

3)  Bezier  ordinates  [2]:  6o,  b2n- 

For  easy  manipulation,  we  break  down  the  non-uniform  B-spline  function 
p{0)  into  the  following  quadratic  Bezier  functions  with  local  parameter  t: 

p(e)  =  (1  -  tfb2i  +  2(1  -  t)tb2i+l  +  t%i+2,  (11) 


0<i  =  ^^<l,  i  =  0,l,...,n-l. 

di+iAOi-i diAOi  n  1 


b2i+l  —  di+i, 
Prom  (11),  we  obtain 


2  =  0,  1, ...  ,n  -  1. 


pi  —  fJ'i  —  ^0  ,  2  —  0,  1,  .  .  .  ,  71, 


_  2AH2i 
'''  “  (AdiY  ’ 


2  =  0,  1, . . .  ,n  -  1. 


We  solve  the  equations  (10)  with  unknowns  0o,0i,...,  ^n,  do,  di, . . . ,  dn+i  by 
the  Newton-Raphson  method,  using  initial  values  from  the  conventional 
cubic  splines.  The  number  of  equations  is  27i,  while  the  number  of  unknowns 
is  2n  +  3.  Accordingly  we  can  give  3  more  additional  requirements.  Since  the 
radius  of  curvature  p{0)  determines  a  unique  curve  shape,  we  can  specify  and 
control  an  interpolating  curve  by  the  control  polygon  (Greville  abscissae  and 
de  Boor  ordinates)  of  p{0).  Therefore  the  curve  includes  circular  arc,  circle 
involute  and  involute  of  the  circle  involute  because  p{0)  is  a  quadratic  B-spline 
function. 

Fig.  3  shows  an  example  of  an  interpolating  involute  curve  with  the 
same  data  points  and  the  same  end  tangents  as  in  Fig.  2.  The  computation 
converged  after  four  iterations.  The  evolute  of  this  curve  has  cusp  points 
within  segments,  since  the  radius  of  curvature  is  quadratic,  while  the  evolute 
in  Fig.  2  has  cusp  points  only  at  junction  points.  The  evolute  of  the  evo¬ 
lute  (circular  arcs)  has  three  cusp  points.  The  curvature  profile  shows  the 
smoothness  of  this  involute  curve. 
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§5.  Some  More  Numerical  Results 

To  illustrate  the  properties  of  the  newly  developed  curve,  we  show  some  more 
examples  of  the  curves.  The  same  data  points  are  used  in  Figs.  1  to  5. 

The  involute  curve  is  practically  more  important  than  the  and 
G^  ones.  Accordingly,  in  Fig.  4  the  G^  involute  in  Fig.  2  is  compared  with 
other  curves:  (a)  the  G^  involute  curve  with  the  same  end  tangents  in  Fig.  3, 
(b)  the  conventional  cubic  spline  curve  which  is  used  as  an  initial  curve 
by  the  Newton-Raphson  method,  and  (c)  a  G^  biarc  curve  derived  by  mini¬ 
mum  difference  between  curvatures  of  two  arcs  [4].  The  labels 
“(7^”  in  Fig.  4  point  out  which  side  the  corresponding  curve  passes  through. 
Small  circles  in  Fig.  4(c)  are  centers  of  circular  arcs.  It  is  understood  from 
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§6.  Concluding  Remarks 

We  proposed  the  design  method  of  up  to  interpolating  curve  as  an  involute 
of  circular  arcs  or  an  involute  of  the  circle  involute  arcs.  This  straightforward 
approach  provides  a  tool  for  the  construction  of  planar  curves  consisting  of 
segments  with  monotone  curvature  radius  plots  of  constant  sign.  Available 
NC  machines  with  an  involute  generator  are  able  to  draw  the  objective  curves 
with  reduced  data. 
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Interpolation  from  Lagrange  to  Holberg 


Michel  Leger 


Abstract.  As  the  order  2n  tends  to  infinity,  Lagrange  iiiterpolators  of 
periodically  sampled  ID  functions  converge  to  the  sine  function  modulated 
by  two  exponentials.  One  is  related  to  instabilities  and  the  other  to  Gaus¬ 
sian  apodizing.  The  Hermite  interpolation  of  Lagrange  interpolators  gives 
convolutive  -differentiable  Lagrange-Hermite  interpolators.  Whereas 
their  support  has  width  of  order  2n  +  2,  the  active  part  of  their  impulse 
response  is  width  of  order  V2n,  instead  of  2n  for  Holberg  interpolators, 
which  are  optimal  combinations  of  Lagrange-Hermite  interpolators,  and 
therefore  much  more  efficient.  Efficient  filters  can  be  derived  frorn  these 
differentiable  interpolators,  as  well  as  numerical  schemes  of  derivatives  at 
any  abscissa. 


§1.  Introduction 

Some  applications  require  very  large  2D  or  3D  regular  grids,  such  as  finite- 
difference  modeling  of  seismic  waves,  for  instance.  The  processing  of  these 
grids  involves  the  computing  of  numerical  schemes  of  first  or  second  deriva¬ 
tives,  and  also  interpolators  and  filters.  These  quantities  need  to  be  evaluated 
very  efficiently  because  the  requirements  in  terms  of  computation  time  and 
memory  use  are  critical. 

Numerical  schemes,  interpolators  and  filters  are  interrelated  issues,  and 
I  choose  to  study  them  from  the  viewpoint  of  interpolation,  which  is  the 
most  general.  For  sake  of  simplicity,  I  assume  ID  interpolation  of  periodically 
sampled  functions,  with  unit-sampling  rate  and  even  orders. 

§2.  Lagrange  Interpolators 

Definition  1.  For  some  function  f  with  known  values  fi  at  abscissae  Xi  =  i, 
i  e  [-n,  n],  the  Lagrange  interpolator  ([9,6,2,3])  of  order  2n  is 

'^2n({/fc}fe=-n)^)  =  n  i  -  j  ~  fiP2n,i{^)-  (1) 

i=— n  j  =  —n  •'  i=—n 

3^i 
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It  can  be  shifted  and  centered  at  any  integral  abscissa  i: 

j=-n 


Proposition  2.  As  2n  oo,  Lagrange  polynomials  converge  to  the  sine 
function  modulated  by  two  exponentials: 


P2n,iW  = 


sin7r(a:  —  i) 
7r{x  —  i) 


-z" 


exp{--^)  exp(-^)  (l  +  0(n  ^)). 


n+  f 


n+ 


Proof:  Simple  changes  in  (1)  give 


(2) 


P2n,i(^)  = 


(n  +  z)!  {n  -  z)!  {x  -  i)  p 


Since  nr=l(l  -  f)  =  have 


(3) 


nc-S)-=^'  nc-S>- 


j=l 


j-n+1 


Since  In n”„+i(l-fJ)  =  -x^T,7+i(ji+0(j  “)),  and  since  ^ 
F 1  "'e  obtain 


(4) 

o(i-^)  + 


OO  2 

n  (1  -  !■) = 

J=n+1 


71+^ 


+  0{n-^)). 


(5) 


Moreover,  by  using  Stirling  formula  n!  =  n”  exp(— n)  y/27m  (1  +  + 

0(n“^)),  and  noting  that  ln(l  +  j|^  + 23!;^+ C>(?2"^))  =  “+0(n“^)and  that 
'  (2  -  -  ;^)  =  0(n-3),  we  obtain  In  =  -(n  +  i  +  1)  ln(l  + 


1 

12n 


^)  -  (n  -  z  +  |)  ln(l  ~  :^)  +  0{n  ^).  Since  ln(l  +  x)  —  x-^  +  ^  +  0{x^ 
we  have  In  =  ii+T  +  0{n-^)  and  hence 


(»!)^ 

{n  +  z)!  (zz  —  z)! 


+  0(n-^)). 


(6) 


Noting  that  (~l)*sin7rrr  =  sin7r(a:  —  z)  and  inserting  (4),  (5)  and  (6)  in  (3) 
concludes  the  proof.  □ 

According  to  the  first  term  of  (2),  Lagrange  polynomials  converge  to  the 
perfect  interpolator  sine  function  (Fig.  1).  Away  from  the  center  of  the  interval 
of  the  data  points,  the  first  exponential  explains  the  well-known  instabilities 
(a  small  change  in  the  data  results  in  a  large  change  in  the  interpolation),  and 
the  second  one  corresponds  to  Gaussian  apodizing  (see  apodization  in  [10]), 
that  is,  the  vanishing  effect  of  the  non-centered  data  points.  Note  that  these 
two  exponentials  compensate  one  another  as  x  — >  z. 
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Fig.  1.  Lagrange  polynomials  of  order  14  divided  by  the  two  exponentials  of  (2), 
for  i  =  0,2, 4, 6.  Related  sine  functions  in  dotted  lines. 


§3.  Stationary  Lagrange  Interpolators 

Since  a  Lagrange  interpolator  is  stable  only  near  the  middle  of  the  interval  of 
the  contributing  data  points,  a  natural  idea  is  to  change  it  for  each  interval 
between  two  successive  points,  in  such  a  way  that  the  interpolator  is  always 
used  at  its  best. 

Definition  3.  The  stationary  Lagrange  interpolator  of  order  2n  uses  at  ab¬ 
scissa  X  the  Lagrange  interpolator  of  order  2n  centered  at  ix,  with  x  =  ix  +  dx, 
ix  G  IN,  4  G  [0,  1[: 


n 

L2n{{fk}ke'Z,^)  =  fix+j  P2n,j{^x)  =  ■^2n(^)'  C^) 

j=—n 


Remark  4.  Any  interpolator  such  that  I({fk}k£Z,  =  Z)j=a  /*x+j  Pj(^x)> 
with  a,  b  and  pj  independent  of  ix,  is  convolutive,  that  is,  there  exists  a 
continuous  function  X(x)  such  that  a?)  =  A{f)*X(x),  with  A(f)  = 

fi  0  feeing  the  ‘‘Dirac  comb”  modulated  by  function  f . 

This  is  obvious  by  considering  the  impulse  response  A(x)  =  T{{6kQ}ke:7L,^)  = 
P-ixi^x),  with  ^*0  the  Kronecker’s  symbol. 

As  a  particular  consequence,  stationary  Lagrange  interpolators  are  con¬ 
volutive  with  impulse  responses  A2n(ic)  =  P2n, Moreover,  they  are 
also  subject  to  Gaussian  apodizing  since,  as  2n  — >  oo, 


A2n(a;)  ~ 


sin  xx 

XX 


exp(^). 


(8) 


It  is  clear  from  Fig.  2  that  these  impulse  responses  vanish  very  rapidly  as 
compared  to  the  sine  function. 

Remark  5.  Since  "ik,  k  G  [1,  2n],  x’‘^ X2n(^)dx  =  0,  then,  with  T  denot¬ 

ing  the  Fourier  transform,  VA:,  k  G  [1,  2n],  (^(A2n)('ii))i=o  “  ^  ([^D- 


Hence,  stationary  Lagrange  interpolators  are  good  at  low  frequencies. 
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Fig.  2.  Impulse  responses  of  stationary  Lagrange  interpolators  for  orders  2  to 
14  (the  last  one  is  truncated),  sine  function  in  dotted  lines. 


§4.  Hermite  Interpolators 

In  a  general  way,  Hermite  interpolators  are  consistent  with  the  values  of  a 
function  and  its  A:  (or  fc,-,  1  <  z  <  1)  first  derivatives  at  I  distinct  points  [4,2,3]. 
Here,  I  consider  convolutive  interpolators  for  any  k  and  1  =  2. 

Definition  6.  The  piecewise  polynomial  Hermite  interpolator  is 
Tikif)  =  ^(/)  *  +  •  •  •  +  A(/(^^)  +  zyjfcj  +  . . .  +  A(/^*^^)  *  T]kk, 

where  rfkj(x)  are  [—1,  +l]-supported  basis  functions  such  that  Vm,  m  G  [0,  k], 

From  now  on,  I  consider  7]ko{x)  functions  only. 

Proposition  7.  The  polynomial  expression  of  Hermite  interpolators  7]ko 

l-Vkoix)  =  {2k  +  l)Cl,J2^  j  +  l”  (9) 

j—k 

forx  G  [0,  1],  and  7]ko{x)  =  7]ko{-x)  forx  G  [-1,  0].  Moreover,  asymptotically, 

lim77fco(^)  ~  nj(a:)  *  2^^exp(-4fca:^),  when  fc  — )•  oo,  (10) 

with  n^(x)  =  1  if  a;  G  [o  -  a  +  |]  and  nj(a:)  =  0  otherwise. 

Proof:  Fora;  G  [0,  1],  it  is  clear  from  Def.  6  that  =  Q!a;*=(l-a;)*'. 

Since  77*0(0)  =  1  and  77*0(1)  =  0,  we  have 

1=  f  (1  -  77*o(a:))(^^dx. 

«  Jo 


(11) 
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Fig.  3.  Right  part  of  impulse  responses  of  Hermite  interpolators  according  to 
(9),  their  asymptotic  form  according  (10)  in  dotted  lines. 

Let  us  define  Ik,i  =  —  xYdx.  Integrating  by  parts  gives  = 

that  is,  Ik,k  =  ^-^2fc,o  =  C'|fc/2fc,o  by  recurrence.  Since 
/2fe,o  =  2j^’  obtain  a  =  {2k-\- and  therefore,  from  (11), 

(1  -  =  (2fc  +  1)  C^k  ^‘(1  -  (12) 

The  binomial  theorem  and  a  simple  integration  concludes  the  proof  of  (9). 
Moreover,  by  the  change  of  variable  x=  |  +  u  in  (12),  we  have  ( 2  +  '*^)  “ 
(2fc  4- 1)  Clfc  4“*^  (1  —  For  any  abscissa  Wi  >  0  we  may  define  Uk  =  ^ 

and  we  have  exp(— 4fcii^)  =  exp(-4'iii)  and  (1  —  4u^)^  =  (1  —  4^)*.  Since 
limit_oo(l  —  4^)*  =  exp(— 4^1),  then  (1  —  4u^)*  ~  exp(— 4/cu^)  as  — >  00. 
Moreover,  by  using  C^k  —  and  Stirling  formula,  we  obtain,  as  A;  — > 

00,  (2k  +  l)(7|fe4”*=  2^,  and  then  -T^j^o  (a;)  ~  2^  exp(-4A;(a;  ~  |)2). 

Symmetrically,  we  have  7y^o^(a;)  ~  2yJ^  exjp{—4k(x  +  |)^)  for  x  G  [—1,0]. 

Therefore,  p^^ix)  ~  2-y/^  exp(— 4itic^)+  (5((C  +  ^)  — (5(a:  — |)),  which  concludes 

the  proof  of  (10)  since  nj^^^(x)  =  6{x  -\-  \)  —  S(x  —  □ 

Note  that  Lagrange  and  Hermite  interpolators  could  be  considered  as 
Fourier  pairs  since  the  right  members  of  (8)  and  (10)  are  mutual  Fourier 
transforms  if  Tr^n  =  4A:. 

§5.  Lagrange-Hermite  Interpolators 

Since  stationary  Lagrange  interpolators  are  good  at  low  frequencies  and  since 
Hermite  interpolators  are  differentiable,  hence  good  at  high  frequencies,  com¬ 
bining  them  is  a  natural  idea. 

Definition  8.  For  k  >  0,  the  Lagrange-Hermite  interpolator  of  order  2n  is  the 
smooth  Hermite  interpolation  of  two  successive  Lagrange  interpolators,  that 
is, 


•^2n  —  Vko(^  *a:) '^2n(^)  "L  (1  Vkoi^  '^x))  (^)‘ 
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Fig.  4.  Right  part  of  the  impulse  responses  of  C^-differentiable  Lagrange-Hermite 
interpolgitors  (a),  sine  in  dotted  lines.  Their  left  part  divided  by  the  sine 

_  2 

(b),  as  compared  to  Gaussian  curves  exp(--;|;j^),  in  dotted  lines. 

Proposition  9.  For  k  >0,  Lagrange-Hermite  interpolators  M.2n^ 
differentiable. 

Proof:  Since  it  is  piecewise  polynomial,  A^2n  ^  C°°-differentiable  for  all 
non-integral  abscissas  x.  Moreover,  is  convolutive,  and  thus  it  is 

sufficient  to  examine  it  around  x  =  0.  From  Def.  8,  for  rc  >  0,  we  have 
=  Vkol^2n+{'^~Vko)Cl^-  From  (12),  we  have and 
then  =  C2n{x)  +  0(x^'^'^)  -  £§„).  Since  the  interpolators 

and  C2n  are  continuous,  £2n  “  ^2n  =  and  thus  =  ^2n  +  0{x^^'^). 

Therefore,  for  any  j  G  [1,  A:  1],  ^'^^0’*’)  =  The  same  argu¬ 
ment  with  M2n^  =  7]k0  C2n  +  (1  -  Vko)  ^^2n  for  ic  <  0  leads  to  M2n^  = 

£2n^\0“),  which  concludes  the  proof  since  C^^ix)  is  a  polynomial.  □ 

The  Fourier  transform  of  Lagrange-Hermite  impulse  responses  (which  I 
call  P2n  interpolator  At  In)  are  very  similar  to  those  of  stationary  Lagrange 
interpolators  below  the  sampling  frequency.  Beyond  the  sampling  frequency, 
for  the  orders  2  to  14,  the  rejection  in  dB  of  the  greatest  secondary  lobe  is 
27,  30,  32,  33,  34,  35  and  35  in  the  case,  42,  47,  51,  53,  55,  57  and  58  in 
the  case,  33,  36,  37,  38,  39,  40  and  41  in  the  case,  and  slowly  decreases 
for  higher  differentiabilities.  From  this  viewpoint,  the  Lagrange-Hermite 
interpolators  are  the  best  choice. 

§6.  Holberg  Interpolators 

Lagrange-Hermite  interpolators  are  good  at  low  and  high  frequencies,  but 
unsatisfactory  inbetween.  Indeed,  Gaussian  apodizing  makes  \2n  as  well  as 
(i2n  gradually  ineffective,  since  their  cost,  which  is  proportional  to  the  length 
of  their  support,  increases  like  n,  whereas  their  active  part  widens  like  >/n. 
Faced  with  the  same  problem  in  terms  of  numerical  schemes,  Holberg  ([5])  had 
the  idea  of  combining  several  orders  and  optimizing  the  passband  for  given 
tolerance  and  maximal  order.  Holberg  interpolators  just  proceed  from  the 
same  idea. 
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Fig.  5.  Impulse  responses  of  C^-differentiable  and  0.1%-precise  Holberg  interpo¬ 
lators  (a)  (Lagrange-Hermite  in  grey,  sine  in  dotted  lines).  Their  Fourier 
transform  (b)  and  an  enlargement  of  them  (c).  The  passband-us-order 
diagram  (d)  showing  the  linear  behaviour  or  Holberg  interpolators  and 
the  parabolic  behaviour  of  the  Lagrange-Hermite  interpolators. 


Definition  10.  The  2n-bybrid  order,  -differentiable  and  e-precise  Hol¬ 
berg  interpolator  is  the  linear  combination  ~  Yll=i  such  that 

/?f  =  1  and  such  that  the  following  passband  is  maximized, 


Clearly,  these  Holberg  interpolators  are  convolutive,  with  impulse  responses 

fit  =  ET/JfML- 

Fig.  5  illustrates  the  case  of  e  =  0.1%  and  k  =  1  (C^-differentiability). 
Fig.  5a  shows  that  the  impulse  responses  of  Holberg  interpolators  decrease 
much  slower  than  the  Lagrange-Hermite  interpolators.  The  sum  of  the  j3f  is 
one,  but  their  absolute  sum  may  be  far  from  one,  for  instance  about  4300  in 
the  case  of  the  14th  order.  In  the  Fourier  domain.  Fig.  5b,  together  with  its 
enlargement  Fig.  5c,  shows  that  the  passband  is  increased  a  lot  from  Lagrange- 
Hermite  to  Holberg. 

For  tolerances  1%  and  0.1%,  Fig.  5d  shows  the  parabolic  behaviour  around 
infinite  order  of  the  passbands  of  the  Lagrange-Hermite  interpolators  (grey 
arrows  are  parabolas  with  vertical  tangent  at  the  corner).  This  is  due  to 
Gaussian  apodizing,  since  the  Fourier  transform  of  (8)  gives  the  convolution 
of  a  box  function  with  a  Gaussian  function  that  narrows  like  as  n  — oo. 
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Fig.  6.  Right  part  of  the  impulse  responses  of  the  lOth-order  Lagrange-Hermite 
(in  grey)  and  Holberg  interpolators  of  first  (a)  and  second  (b)  deriva¬ 
tives.  Perfect  (but  truncated)  interpolators  for  derivatives  are  displayed 
in  dotted  lines. 

On  the  contrary,  the  passbands  of  the  Holberg  interpolators  have  a  linear 
behaviour  (black  arrows  are  straight),  which  is  the  best  possible  one  because 
the  active  part  of  the  impulse  response  of  the  interpolator  cannot  expand  faster 
than  its  support.  The  passbands  are  referred  to  the  sampling  frequency. 

The  optimization  of  passband  B  in  the  /3f  space  is  difficult  because  B 
is  discontinuous  along  n  —  1  hypersurfaces  (related  to  tangencies  at  lie) 
which  intersect  at  the  optimum.  To  overcome  this  difficulty,  I  used  a  method 
consisting  of  the  following  steps: 

1)  choose  a  strictly  increasing  sequence  of  frequencies  i/j  in  ]0,  0.5[,  with 
1  <z  <n-l, 

2)  set  at  1  -t-e  the  value  of  the  combination  at  frequency  i/n-i?  1  —  £  at  i^n-2, 

1  -f-  e  at  and  so  on  until  i/i,  and  finally  1  at  frequency  uq  =  0, 

3)  solve  the  linear  system  for  the  , 

4)  detect  the  frequencies  at  which  the  combination  is  extremal, 

5)  if  these  frequencies  are  close  to  the  Ui,  then  stop,  else  update  the  i/j  and 
come  back  to  step  2. 

A  priori,  the  feasability  of  steps  3  and  4  and  the  convergence  are  not 
guaranteed.  In  practice  however,  this  method  works  simply  well,  with  less 
than  ten  iterations.  See  also  [7,8]  on  optimal  filtering. 

§7.  Applications 

The  main  application  of  Holberg  interpolators  are  Holberg  numerical  schemes 
([5])  because  they  are  cost-effective  in  the  field  of  numerical  simulation  of 
acoustic  wave  propagation.  Especially  in  3D,  this  effectiveness  is  of  consider¬ 
able  importance  because  of  the  huge  amount  of  computing  time  needed. 

From  a  2n-order  C^-differentiable  Lagrange-Hermite  or  Holberg  interpola¬ 
tor,  2n-order  “"‘-differentiable  interpolators  of  the  mth  derivative  (m  <  k) 
can  be  easily  derived,  as  well  as  numerical  schemes  of  these  derivatives  at  any 
abscissa.  Fig.  6  shows  the  responses  of  the  lOth-order  Lagrange-Hermite  and 
Holberg  {e  =  0.1%)  interpolators  of  first  (a)  and  second  (b)  derivatives.  The 
values  at  integral  abscissas  give  standard  numerical  schemes.  In  a  similar  way, 
the  integration  of  these  impulse  responses  could  result  in  Newton-Cotes-like 
Holberg  formulas  (see  Newton-Cotes  Formulas  in  [10]). 
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Fig.  7.  A  seismic  section  (a)  has  been  horizontally  filtered  (b)  for  antialiasing, 
threefold  undersampled  (c),  and  finally  interpolated  (d). 

Holberg  interpolators,  and  numerical  filters  that  can  be  generated  from 
them,  are  also  interesting  for  their  efficiency.  For  instance,  the  response  of 
a  1%-precise,  6th-order,  Holberg  interpolator  has  been  fourfold  oversam¬ 
pled  and  used  to  filter  horizontally  Fig.  7a  into  Fig.  7b  for  antialiasing.  The 
threefold  undersampling  of  Fig.  7b  gives  Fig.  7c.  The  1%-precise,  6th-order, 
threefold  Holberg  interpolation  of  Fig.  7c  gives  Fig.  7d.  The  similarity  of 
(b)  and  (d)  measures  the  quality  of  the  filtering  and  of  the  interpolation. 

§8.  Conclusions 

In  the  case  of  periodic  data  points,  Lagrange  interpolators  converge  to  a  sine 
function  multiplied  by  two  exponentials.  The  first  one  explains  the  well- 
known  instabilities  of  Lagrange  polynomials,  which  only  vanish  at  the  center 
of  the  interval  of  the  data  points.  The  second  exponential  explains  the  van¬ 
ishing  influence  of  non-centered  data  points  (Gaussian  apodizing).  Station¬ 
ary  Lagrange  interpolators  are  stable  and  convolutive.  Hermite  interpolators 
are  as  differentiable  as  desired  and  convolutive  and  their  impulse  response 
converges  to  the  convolution  of  a  box  function  with  a  Gaussian  function. 
Lagrange-Hermite  interpolators  combine  the  advantages  of  unlimited  order 
and  differentiability.  Because  of  Gaussian  apodizing,  these  interpolators  be¬ 
come  ineffective  at  high  orders.  On  the  other  hand,  Holberg  interpolators 
have  a  much  better  quality /cost  ratio  since  they  are  optimal  combinations  of 
Lagrange-Hermite  interpolators.  From  Holberg  interpolators,  efficient  numer- 
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ical  schemes  of  derivatives  can  be  evaluated  at  any  abscissa,  and  oversampling 
their  impulse  response  gives  short  but  efficient  filters. 
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Local  Approximation  on  Manifolds  Using 
Radial  Functions  and  Polynomials 


Jeremy  Levesley  and  David  L.  Ragozin 


Abstract.  The  main  focus  of  this  paper  is  to  give  error  estimates  for 
interpolation  on  compact  homogeneous  manifolds,  the  sphere  being  an  ex¬ 
ample  of  such  a  manifold.  The  notion  of  a  radial  function  on  the  sphere  is 
generalised  to  that  of  a  spherical  kernel  on  a  compact  homogeneous  man¬ 
ifold.  Reproducing  kernel  Hilbert  space  techniques  are  used  to  generate 
a  pointwise  error  estimate  for  spherical  kernel  interpolation  using  a  posi¬ 
tive  definite  kernel.  By  exploiting  the  nice  scaling  properties  of  Lagrange 
polynomials  in  the  tangent  space,  the  error  estimate  is  bounded  above 
a  power  of  the  point  separation,  recovering,  in  particular,  the  convergence 
rates  for  radial  approximation  on  spheres. 


§1.  Introduction 

There  is  currently  significant  interest  in  approximation  on  spheres,  related 
to  many  interesting  geophysical  problems.  There  are  a  number  of  different 
approximation  methods  currently  available  on  spheres,  including  wavelets  [3], 
splines  [1],  and  the  subject  of  this  paper,  radial  functions  (sometimes  called 
spherical  splines)  [3,6].  Error  estimates  and  convergence  rates  for  radial  ap¬ 
proximation  on  spheres,  of  an  optimal  nature,  are  recent  in  vintage  [5,4],  and 
rely  on  some  technically  demanding  mathematics.  In  this  paper  we  build  on 
an  idea  of  Bos  and  de  March!  [2]  in  order  to  provide  convergence  rates  for  ra¬ 
dial  interpolation  on  a  much  wider  class  of  manifolds:  the  reflexive,  compact 
homogeneous  spaces.  We  will  conclude  the  paper  by  proving  a  local  spherical 
harmonic  polynomial  approximation  result  on  spheres. 

Let  be  a  d-dimensional  compact  manifold  with  a  metric  d(-,  •)  which 
possesses  a  transitive  group  G  of  isometries.  The  group  acts  transitively  in 
that  for  every  x,y  e  M^,  there  exists  g  e  G  such  that  gx  =  y.  If,  further¬ 
more,  there  exists  g  E  G  such  that  gx  =  y  and  gy  =  x,  then  is  termed 
reflexive  (for  more  details  see  [11]).  Such  a  manifold  is  a  reflexive,  compact, 
metric,  homogeneous  space.  We  comment  that  we  can  always  embed  in 
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some  higher  dimensional  Euclidean  space  the  group  G  being  a  compact 

subgroup  of  the  isometries  of  We  aissume  that  the  metric  d{‘,  •)  on 

is  inherited  from  some  Euclidean  embedding. 

We  will  be  interested  in  interpolation  on  using  continuous  zonal  ker¬ 
nels  k{‘,  •)  which  have  the  property  that  k{gx,gy)  =  k{x,y)  for  all  x,y  e 
and  g  e  G,  Such  kernels  are  natural  generalisations  of  radial  functions, 
which  are  functions  only  of  distance,  which  is  itself  G-invariant.  Given  a 
set  {a^i, . . . ,  Xiv}  C  and  data  /i, . . . ,  /tv  ^  we  seek  a  function  of  the 
form 

TV 

Sk{x)  =  "^aik{x,Xi) 
i=l 

such  that  Sk{xi)  =  fi,  i  —  1, . . . ,  N. 

Given  the  data  fi  =  f{xi),  i  =  we  wish  to  bound  the  pointwise 

error  between  sjt  and  f  at  x  e  M^.  We  make  no  assumption  on  the  data 
points  except  that  they  satisfy  a  point  separation  criteria  in  some  subset  of 

(see  Section  3). 

In  Section  2  we  will  introduce  some  necessary  harmonic  analysis  on 
discuss  the  notion  of  positive  definiteness  on  in  brief,  and  give  a  standard 
error  estimate,  which  we  will  use  in  Section  3  to  obtain  convergence  rates.  In 
Section  4  we  prove  a  Whitney  type  error  estimate  for  local  spherical  harmonic 
approximation  on  the  sphere. 

§2.  Harmonic  Analysis  and  Error  Estimates 

For  a  more  complete  version  of  the  brief  description  we  give  here,  see  [10,11]. 
Let  be  the  degree  n  polynomials  in  IR'^"^’’,  the  space  in  which  is 

homogeneously  embedded.  Then,  let  Vn  (the  spherical  polynomials  of  de¬ 
gree  n)  be  the  restriction  of  these  polynomials  to  M^.  Furthermore,  let 
'Hn  :=  Vn  where  the  orthogonality  is  with  respect  to  dfi,  the  unique 

normalised  G-invariant  measure  on  M^: 

[/,P]:=  / 

Jm<^ 

Then,  we  can  uniquely  decompose  Tin  into  irreducible  G-invariant  sub¬ 
spaces  each  of  dimension  d^j,  j  =  1, . . . ,  h„,  resulting  in  the  G-invariant 
decomposition 

^•2(M'')  =  ©”o<=1  =nj- 

Let  Xnj  be  the  orthogonal  projection  onto  n  =  0, 1, ... ,  and  j  = 
l,...,hn,  and  7^ni(-,-)  be  the  kernel  of  this  projection.  We  will  consider 
interpolation  using  strictly  positive  definite  kernels  of  the  form 

oo  d„ 
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where  a„j  >  0,  n  =  0, 1, . . . ,  j  =  1, . . . ,  hn,  and 

OO  dn 

EE  dnja-fij  <  OO.  (1) 

n=0 j=l 

We  will  approximate  functions  from  the  Hilbert  space 

OO  dn 

W  =  {/  G  L2{M'‘)  :  ll/f  :=  53  E 

n=0 j=l 

where  11-112  denotes  the  L2{M*^)  norm.  The  associated  inner  product  in  W  is 

(/jp)  ♦—  ^^^^[^njf}^nj9]/^nj‘ 
n=0  jf=l 

The  condition  (1)  ensures  that  point  evaluation  is  a  continuous  linear 
functional  in  W.  It  is  straightforward  to  show  that  k  is  the  reproducing  kernel 
for  the  W:  f{x)  —  (/,  k{x,  •)),  /  €  W,  x  6  An  immediate  consequence  of 
the  reproducing  kernel  property  is  that  Sk  is  the  interpolant  of  minimum  W 
norm.  For,  if  g  is  another  interpolant, 

N  N 

{sk-9,sk)  =  Y^ai(f  -  Sk,k{xi,^))  =  -Sfcfe))  =  0- 

i=l  i=l 

Therefore, 

(9,9)  =  {9~Sk-\-  Sk,9  -  Sfc  +  Sfc)  =  {9-  Sk,9-  Sk)-\-2{g-  Sk,Sk)  +  (sjfc,Sfc) 

=  {9-Sk,9-  Sk)  +  (sfc,sfc),  (2) 

and  the  norm  minimisation  property  is  established.  Now,  following  the  stan¬ 
dard  arguments,  see  e.g.  [8,9],  we  have,  using  the  fact  that  Sk  interpolates  / 
at  xi, . . . ,  Xjv, 

\f{x)  -  Sk{x)\  =  \{f  -  Sk,k{x,’)\ 

N 

=  \{f  -  Sk,k{x,  •)  -h  '^Pik{xi,  ■)! 

i=l 

N 

i=l 

<l!/lll|fc(x.-)  +  EAMxi,-)ll 

i=l 


294  J-  Levesley  and  D.  L.  Ragozin 

for  arbitrary  (di,  i  =  1, . . . ,  iV,  where  we  have  used  (2)  in  the  final  step.  Our 
final  error  estimate  follows  from  the  fact  that 

N  N  N 

2  =  1  2  =  1  i  =  l 

N  N 

=  {k{x,x)  -2'^Pik{x,Xi)+  ^  (3iPjk{xi,Xj))^. 
2=1  i,j=l 


Defining 


N  N 

P{x,Xi,. . .  ,xn)  inf  (k{x,x)  -  2'^Pik{x,Xi) -h  Y]  f3if3jk{xi,Xj))^ , 

. fel  ij=i 


we  sum  these  results  up  in 

Theorem  1.  Let  Sk  be  the  k-spUne  interpolant  at  xi, . . .  ,xjv  6  to  f  £  W. 
Then,  for  every  x  €  M^, 

\f{x)  -5fc(2:)|  <  ||/||P(a:,xi,...,a:;v). 


§3.  Convergence  Rates  for  Radial  Kernels 

In  this  section  we  shall  give  pointwise  error  estimates  in  terms  of  the  point 
separation 

/?  :=  max  min  d(y,Xi), 
yeV 

where  V  C  contains  x,  the  point  at  which  we  are  measuring  the  error.  As 
we  shall  see  later  in  this  section,  producing  a  pointwise  convergence  rate  from 
the  error  estimate  of  Theorem  1  requires  us  to  bound  Lagrange  polynomials 
related  to  a  subset  of  the  interpolation  points.  Efforts  to  produce  convergence 
rates  on  the  sphere  foundered  because  it  is  difficult  to  bound  the  Lagrange 
polynomials  for  spherical  harmonic  interpolation  as  the  interpolation  set,  with 
a  fixed  number  of  points,  scale  towards  x.  The  early  error  estimates  of  [3], 
of  0(p),  were  the  best  known  until  recently,  and  only  required  bounding  of 
the  constant  Lagrange  polynomial  for  a  single  point.  Light  and  v.  Golitschek 
[4]  proved  boundedness  for  all  polynomials  on  d  >  2,  and  consequently 
achieved  0{p'^)  approximation  for  radial  kernels  with  2r  continuous  derivatives 
on  the  sphere. 

A  very  simple  proof  of  the  result  of  Light  and  v.  Golitschek  was  given  by 
Bos  and  de  Marchi  in  [2].  What  we  will  do  is  introduce  an  analytic  coordinate 
transformation,  and  construct  Lagrange  polynomials  in  the  tangent  space, 
which  is  a  d-dimensional  Euclidean  space.  We  will  quote  a  result  which  uses 
scaling  arguments  in  Euclidean  space  which  are  easy  to  perform,  observing 
that  distance  on  the  manifold  and  in  the  tangent  are  comparable. 
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Fig.  1.  Coordinate  chart  at  x. 

Let  X  ^  V  C.  M.  We  shall  assume  the  existence  of  a  (7°°-chart 
with  an  open  subset  U  CUq  satisfying  the  following  (see  Figure  1): 

1)  ^(17)  =  V  with  '0(0)  =  a;, 

2)  1^0  =  {y  -  Z  :  y,z  e  [/}. 

These  conditions  ensure  the  validity  of  Taylor  series  arguments  which  follow. 
Also,  since  1/  is  precompact,  ip  is  bi-differentiable  and  the  metric  d  is  assumed 
boundedly  equivalent  to  the  Euclidean  distance  on 

Ci\\y-z\\  <d{'ip{y)A{z))  <C2\\y-z\l  y,zeu. 

Let  111, ...  ,vq  =  1/  n  X  be  the  interpolation  points  in  V,  and  redefine 
P  ■-  sup„gy  minf=i,...,Q  d{v,VQ).  Let  Ui  =  ip~^{vi),  z  =  1, . . . ,  Q.  Then,  from 
the  previous  equation  we  have 

r;:=sup  min  |fy-'ail|<p/ci.  (3) 

It  is  shown  in  [7]  that  provided  p  and  hence  rj  are  sufficiently  small  to 
guarantee  that  Q  >  t,  we  can  make  a  selection  of  interpolation  points  vi,...,vt 
(assuming  a  convenient  ordering  of  the  points),  where  t  =  dim(nf^_i),  such 
that  the  Lagrange  polynomials  for  wi, . . .  are  bounded  at  the 

origin: 

Pi{0)<CL,  z  =  l,...,t,  (4) 

where  Cl  is  independent  of  i  and  p.  Furthermore,  -ui, . . .  are  all  contained 
in  :=  {qw  :  ||u;||  <  6}.  We  are  now  ready  to  prove  the  main  theorem  of 
this  paper: 
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Theorem  2.  Let  (/»:  B,  IR  and  fc(-,0  =  0(cf(-,'))  strictly  positive 
definite,  and  2r-times  continuously  differentiable  in  each  variable.  Let  x  6 
S  C  M.  Suppose  that  the  interpolation  points  xi,. . .  ,Xn  satisfy 

sup  min  d{y,Xi)  ^  p. 

yesi=L.-,n 

Then  for  all  sufficiently  small  p,  if  Sk  is  the  k-spline  interpolant  to  f  eW, 
\f{x)-Sk{x)\<C\\f\\p^, 
where  C  is  independent  of  p. 

Proof:  First  we  now  choose  the  coefficients  (appearing  in  the 

statement  of  Theorem  1)  as  follows.  Let  /?i  =  Pj(0)  if  Xi  =  i’iuj)  for  some 
j  Set  Pi  =0  otherwise.  This  choice  of  coefficients  is  made  since 

=  g(0),  (5) 

j=i 

for  all  q  G  Tf^r-v  Then, 

t 

P{x,xi,...,xn)  <  U(d(V>(0),V’(0)))  -2'y^pj{0)(t>{d{4>{0),i>iuj))) 

,  (6) 

+  T.  Pi{o)pj{o)<t>{d{il>M,i>M))\  ■ 

i,}  =  l 

Since  (f){d{',  •))  is  2r  times  continuously  differentiable  in  each  variable,  for 
fixed  w  e  U  we  may  expand 

0(d(V^(w),V'(^)))  =  zeU,  (7) 

|al<2r 

where  R^r  is  a  Taylor  series  remainder  satisfying 

R^M^CRWz-wf,  (8) 

for  some  constant  Cr  independent  of  2  and  w.  Putting  z  =  w  in  the  above 
expansion  we  see  that 

es  =  0(0).  (9) 

Putting  (7)  into  (6)  gives 

P(x,a;i,...,a:jv)  <  [<^(0)  “  2^Pj(0)(  ^  (uj)"  +  P2r(i^i)) 

j=l  |Q|<2r 

»j=l  |Q;|<2r 
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Using  the  polynomial  reproduction  (5),  we  get 

t 

P{x,xi,...,xn)  <  [  -  (f){0)  -  2Y^pj{0)R^^{uj) 

j-i 
t 

+E?'-(o)(  E  (10) 

i=l  jQi|<2r 

^  1 1/2 
+  E  Pi(o)pj(o)'R2;(»j)J 

ij=l 

where  we  have  used  (6)  and  (9)  in  the  above  argument. 

Now,  since  the  distance  function  is  symmetric,  for  any  UyW  E  IR*^, 

|a|<2r 

=  E  <=o(®-“)“+'R2r(™)- 

ta!<2r 

In  particular,  with  ty  =  0  we  get 

E  <=«(-“)“=  E 

|a|<2r  |a|<2r 

Substituting  the  last  equation  into  (10)  and  again  using  (5)  gives 

t 

P{x,  Xi,  .  .  .  ,  Xj\i)  ^  ^  ~  ^(0)  ~  2  (0)i?2r(^j  ) 

i=i 

+Epi(o)(  E  c“(«i)“+<(»)-iij,(o)) 

i=l  |Q;|<2r 

+  E  Pi(0te(0)-R2M“j)]  ' 

i,i=l 

<  [-Epi(o)«(“i)+«2;(o)) 

j=i 

*  t1/2 

+  Pi(0)Pj(0)-^2r(l^;)j 

i,J=l 

t 

<  Cr{,  max  {||«j-«j||,||«i||})''  E  IPi(0)l|2  +  Pj(0)| 

t,J  =  l 

t 

<CRrf  Y,  Cl{2  +  Cl) 

i,j=l 

<Cp\ 
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using  (3),  (4)  and  (8),  recalling  that  are  contained  in  an  77  <  p/ci 

scaled  ball  of  radius  h.  Substituting  this  result  in  Theorem  1  concludes  the 
proof.  □ 


§4.  Whitney-type  Estimates 

We  will  now  use  the  coordinate  system  suggested  by  Bos  and  de  Marchi  [2] 
to  prove  Whitney  type  estimates  for  approximation  using  spherical  harmonics 
on  This  result  answers  a  question  posed  by  L.  L.  Schumaker  during  the 
conference  for  which  these  are  the  proceedings.  For  a:  =  (0:2, ,  a^d+i)  €  K  , 
define  ©  :  5''^  by  0(x2, . .  .,Xd+i)  =  ((1  - 

This  is  a  smooth  parametrisation  of  a  neighbourhood  of  ei  =  (1, 0, . . . ,  0)  €  S^. 
As  long  as  ^  d(ei, 0(cc))  <  p. 

Without  loss  of  generality,  we  shall  consider  the  approximation  of  a  func¬ 
tion  f  eC^{S^),  using  spherical  polynomials,  in  any  spherical  neighbourhood 
Xp  of  ei,  where  maXy^Xp  d(ei,y)  =  p  <  7r/2.  In  fact,  we  will  prove 

Theorem  3.  Let  f  e  Then,  there  exists  a  degree  k  harmonic  poly¬ 

nomial  pk-i  such  that,  for  every  7r/2  >  p  >  0, 

max \f(x)  -  Pk-i{x)\  <  C'(/)p^ 

x^Xp 

where  the  constant  C(f)  does  not  depend  on  p. 

Proof;  The  crucial  element  of  this  proof  is  that  the  coordinate  mapping  © 
maps  polynomials  of  degree  k  in  R^,  the  tangent  plane  at  ei  coordinatised 
by  X2,  ■  ‘  >  ,Xn+i,  to  polynomials  of  degree  k  on  the  sphere.  Since  /  o  ©  e 
C^{B~^Xp),  we  can  perform  the  multivariate  Taylor  series  expansion 

f  0&{x)  =  ^  CaX°  Rk{f,x), 

|a|<fc 

where  Rk{f,x)  is  the  remainder  satisfying 

Rk{f,x)  <  (7(/)(max  \e^\y)\)^ .  (12) 

yeXp  ^  ' 

Letting  6  =  ©(a;),  and  defining  the  degree  k  —  1  spherical  polynomial 

:=  x;  c»(0'‘(e))“  =  E 

|Q|<fe  |a|<r 

equations  (11)  and  (12)  tell  us  that 

max  \m  -  p,.,{e)\  <  C(/)(max  |e-'(!/)|)''. 

tfgAp  ytAp 

The  result  follows  because  maXy^Xp  |€)“^(2/)|  <  P-  ^ 


Approximation  on  Manifolds 


299 


Acknowledgments.  David  Ragozin’s  work  was  partially  supported  by  NSF 
Grant  DMS-9972004. 


References 


1.  Alfeld,  R,  M.  Neamtu,  and  L.  L.  Schumaker,  Fitting  scattered  data  on 
sphere-like  surfaces  using  spherical  splines,  J.  Comput.  Appl.  Math.  73 
(1996),  5-43. 

2.  Bos,  L.  and  S.  De  Marchi,  Limiting  values  under  scaling  of  the  Lebesgue 
function  for  polynomial  interpolation  on  spheres,  J.  Approx.  Theory  96 
(1999),  366-377. 

3.  Freeden  W.,  On  spherical  spline  approximation  and  interpolation.  Math. 
Meth.  in  Appl.  Sci.  3  (1981),  551-575. 

4.  von  Golitschek,  M.  and  W.  A.  Light,  Interpolation  by  polynomials  and 
radial  basis  functions  on  spheres,  Constr.  Approx.,  to  appear. 

5.  Jetter,  K.,  J.  Stockier,  and  J.  D.  Ward,  Error  estimates  for  scattered  data 
interpolation  on  spheres,  Math.  Comp.  68  (1999),  733-747. 

6.  Levesley,  J.,  W.  Light,  D.  L.  Ragozin,  and  X.  Sun,  A  simple  approach  to 
the  variational  theory  for  interpolation  on  spheres,  in  New  Developments 
in  Approximation  Theory^  M.  W.  Muller,  M.  D.  Buhmann,  D.  H.  Mache, 
and  M.  Felten  (eds.),  ISNM  132,  Birkhauser  (1999),  117-143. 

7.  Levesley  J.  and  D.  L.  Ragozin,  Radial  basis  interpolation  on  compact 
homogeneous  manifolds:  Convergence  rates,  preprint. 

8.  Light,  W.  A.  and  H.  Wayne,  Error  estimates  for  approximation  by  radial 
basis  functions,  in  Wavelet  Analysis  and  Approximation  Theory,  S.P. 
Singh  and  A.  Carbone  (eds.),  Kluwer  Academic,  Dordrecht,  1995,  215- 
246. 

9.  Madych,  W.  R.  and  S.  A.  Nelson,  Multivariate  interpolation  and  condi¬ 
tionally  positive  definite  functions  II,  Math.  Comp.  54  (1990),  211-230. 

10.  Ragozin,  D.  L.  and  J.  Levesley,  Zonal  kernels,  approximations,  and  pos¬ 
itive  definiteness  on  spheres  and  compact  homogeneous  spaces.  Surface 
Fitting  and  Multiresolution  Methods,  A.  Le  Mehaute,  C.  Rabut,  and  L.  L. 
Schumaker  (eds.),  Vanderbilt  University  Press,  Nashville,  1997,  143-150. 

11.  Ragozin,  D.  L.  and  J.  Levesley,  The  density  of  translates  of  zonal  functions 
on  compact  homogeneous  spaces,  J.  Approx.  Theory  103  (2000),  252-268. 

Jeremy  Levesley 
Department  of  Mathematics 
University  of  Leicester 
Leicester  LEI  7RH,  UK 
jll®mcs. le.ac.uk 


300 


J.  Levesley  and  D,  L.  Ragozin 


David  L.  Ragozin 
University  of  Washington 
Department  of  Mathematics 
Seattle,  WA  98195-4350,  USA 
ragQmath . Washington , edu 


Characterizations  of  Native  Spaces 


Lin-Tian  Luh 


Abstract.  In  the  theory  of  radial  basis  functions,  linear  combinations 
of  the  translates  of  a  single  function  $  are  used  as  interpolants.  The  space 
spanned  by  all  of  these  linear  combinations  carries  an  inner  product  defined 
via  $  itself.  It  can  be  completed  and  becomes  a  Hilbert  space,  called  the 
native  space  for  which  is  of  great  importance  for  further  investigation 
of  radial  basis  functions.  The  native  space  will  contain  abstract  elements 
which  are  not  linear  combinations  of  radial  basis  functions,  and  require 
some  work  to  be  recognized  as  functions.  This  paper  provides  some  char¬ 
acterizations  of  native  spaces  and  relates  some  of  the  different  approaches 
used  to  define  them.  Finally,  embedding  results  for  native  spaces  into 
Sobolev  spaces  are  proven. 


§1.  Introduction 

Our  goal  is  to  describe  properties  of  the  set  of  functions 

N 

xen,  CjGC,  (1) 

j=l 

where  is  a  subset  of  and  $  is  a  real-valued  symmetric  function  on 
Q.  X  Q.  These  functions  depend  on  sets  X  =  {a:i, . . . ,  xjv}  C  fl  of  iV  pairwise 
distinct  points  called  “centers”,  while  the  number  N  of  centers  and  their 
placement  within  0,  are  arbitrary.  Functions  of  the  form  (1)  arise  naturally  as 
tools  for  multivariate  approximation,  especially  if  $  is  a  radial  basis  function 
^{x,y)  :=  <^(||a:-y|l2)  with  a  real-valued  function  (f)  on  [0,  oo).  We  shall  study 
the  closure  of  the  linear  span  of  functions  (1)  under  a  natural  topology  that 
comes  from  $  itself,  provided  that  $  has  a  crucial  property; 

Definition  1.  A  function  $  €  (7(11  x  li)  is  called  conditionally  positive  definite 
(abbreviated  as  c.p.d.j  of  order  m  on  Q  if  the  quadratic  form 

N 

^  ^  Cj  Cfe  $ (xj  ,  Xfc  ) 
j,Jb=l 
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is  positive  for  all  sets  X  =  {xi, . . . ,  x;v}  C  of  N  pairwise  distinct  points 
and  all  vectors  c  =  (ci, . . . , cat)^  €  C'^  \  {0}  satisfying 

N 

;^Cip(xi)  =  0fora/ip6F^,  (2) 

i  =  l 

where  is  the  space  of  d-variate  complex-valued  polynomials  of  order  not 
exceeding  m. 

There  are  various  possibilities  to  proceed  from  here.  Already  in  their  early 
pioneering  papers,  Madych  and  Nelson  already  took  two  different  approaches, 
via  finitely  supported  functionals  [3]  and  via  a  specific  version  of  generalized 
Fourier  transforms  [4]  in  the  spirit  of  Gelfand  Shilov.  The  latter  requires 
measure-theoretic  arguments  at  certain  places,  and  is  rather  complicated  to 
deal  with.  The  dissertation  of  Iske  [1]  used  variational  inequalities,  while 
Weinrich  [6]  proceeded  via  regularized  distributions  in  the  sense  of  Schwartz. 
Our  goal  here  is  to  show,  as  far  as  possible,  the  equivalence  of  the  cited 
approaches.  Since  the  access  via  generalized  Fourier  transforms  has  problems 
in  dealing  with  arbitrary  domains  ft  C  we  proceed  as  in  [5]  in  order  to 
start  with  the  most  general  approach  known  so  far. 

§2.  Construction  via  Finitely  Supported  Functionals 

Consider  the  space 

{P^)q  :=  Ci  G  C,  G  for  l<i<  N  with  (2)| 

of  all  functionals  that  are  finitely  supported  in  and  vanish  on  the  polynomials 
in  P^.  Starting  with  a  c.p.d.  function  $  of  order  m  in  fl,  we  define 

N  M 

:=  ^  ^  ^  ^  Vj) 

i=i  i=i 

for  X,n  €  {Pi)^  with  A  =  ge*  '""ef 

product  (*,  •)^  which  induces  a  ^-dependent  norm  in  the  ^-independent  space 
(P^)q.  To  relate  functionals  with  functions,  we  use  the  map 

:  (Pi)h  -  C'(n),  R^(\)  :=  A"$(x,  ■)  =:  A  * 

where  A^  stands  for  the  action  of  A  with  respect  to  the  variable  x.  By  standard 
Hilbert  space  arguments,  the  fundamental  identity 


=  {X,li)i  for  all  A,m  €  (Pi)h. 


(3) 
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proves  that  image  under  form  a  dual  pair.  Furthermore, 

this  equation  carries  over  to  the  Hilbert  space  closures  of  and 

P$,n  of  R^{{P^)^),  respectively. 

This  construction  is  simple,  but  it  leads  to  rather  abstract  elements  in¬ 
stead  of  classical  functionals  and  functions.  To  overcome  this  problem,  one 
assumes  that  and  ft  allow  a  Lagrange-type  basis  with  m'  = 

dimP^  and  points  a;i, . . .  ,Xm'  ^  ^  such  that  li{xj)  =  8ij  for  1  <  i,j  <  m'. 

Then  the  functional  6(^x)  •=  ~  li{x)8xi  lies  in  (Pm)h^ 

with 

S^{fi){x)  :=  =  fJ'{R^{^{x)))  for  all  p  G  P$,n,a:  G  O 

uses  (3)  to  define  a  classical  function  for  each  abstract  element  p  G  P$,n. 

The  space  :=  «5$((P,^)q)  now  is  a  much  more  concrete  space.  The  first 
of  our  results  can  be  found  in  [2]  with  full  proofs. 

Theorem  2.  The  spaces  :=  R^(P^,n)  and  :=  5$(P$,f2)  are  iso- 
metncally  isomorphic  via  the  mapping  5$  o  and  the  inner  product  it 
introduces  on  Furthermore, 

/x(P$(A))  =  (A,/i)$=//(P$(A))  (4) 

holds  for  all  functionals  in  (P^Jq  and  its  closure  P$,n. 

It  is  not  straightforwardly  possible  to  associate  classical  function  values 
to  the  elements  P$(A).  But  (4)  indicates  that  P$(A)  and  5$(A)  should  agree 
up  to  a  polynomial  from  P^  on  fl.  The  function  S^(A),  however,  vanishes  on 
the  points  we  used  for  the  Lagrange  interpolation  in  P^,  and  thus  realizes  a 
very  special  assignment  of  function  values  modulo  P^.  Thus,  we  can  interpret 
P^>(A)  as  an  equivalence  class  of  functions  mod  P^  on  ft,  one  representer  of 
which  is  5$ (A).  Thus  we  should  add  P:^  to  the  spaces  we  dealt  with  so  far. 

Definition  3.  Let  $  be  c.p.d.  of  order  m>0  in  fl.  Then  the  direct  sum 

N^{n)  ■.=  Pi{n)®Gi,a 

is  called  the  native  space  of  $. 

The  above  construction  allows  us  to  define  a  semi-inner  product  (.,.)$  on 
this  space  such  that  the  nullspace  is  P^.  Theorem  2  now  implies  the  isometric 
isomorphisms  N^{Q,)  ~  PmW  ®  N^{fl)  ^  P^i^)  ® 

characterizations  of  the  native  space.  We  add  two  others,  with  proofs  in  [2] 
dating  partially  back  to  [3]; 

Theorem  4.  Assume  fl  is  a  subset  of  IR'^  and  m  >  0.  Then  is  the 

unique  subspace  of  (7(fl)  with  a  semi-inner  product  (•,  •)$  satisfying 

(a)  the  null-space  of  the  semi- norm  is  P^(Q), 

(b)  N^(fl)/P^(Q)  is  a  Hilbert  space, 

(c)  if  p  e  (Pm)Q^  fhen  p  *  ^  e  and  (p  *  ^,f)^  =  p{f)  for  all  /  G 

N^{a). 
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Theorem  5.  Fix  m  >  0  and  a  c.p.d.  function  $  of  order  m  in  fl.  Then  a 
complex-valued  function  f  on  ft  is  in  A^$(n)  iff  there  is  a  constant  c{f)  such 
that 

Im(/)I  <  c(/)||/i||4. 

for  all  p  in  (P^)q.  The  smallest  possible  constant  for  such  f  is  the  seminorm 

ll/lk- 

The  following  sections  will  proceed  gradually  from  here  to  other  char¬ 
acterizations  of  native  spaces.  The  main  guideline  is  the  various  forms  that 
functionals  can  take,  starting  from  finitely  supported  functionals  used  in  this 
section.  We  proceed  via  measures  (finitely  or  compactly  supported)  to  distri¬ 
butions,  and  we  refer  the  reader  to  [2]  for  full  proofs. 

§3.  Construction  via  Measures 

Definition  6.  The  family  of  all  hnitely  supported  measures  on  H  is  denoted 
by  M{n). 

Theorem  7.  Let  m  be  a  nonnegative  integer.  Assume  $  is  positive  definite 
in  with  the  following  property:  for  all  X  £  M{fl)  and  s  >  0,  there  exists 
in  satisfying 

ll^e  -  ^Ik  <  £• 

Then  (Pm)a  ^s  contained  in  and  M(Q)  is  isometrically  isomorphic  to 

a  dense  subset  of  iPm)h'  Furthermore,  we  have 

N^{n)  ^  p^{n) 

where  the  closure  is  induced  by  4>.  The  inner  product  on  M{Ll)  is  defined  as 
(X,p)^  :=  X(p  ♦  $). 


Now  we  introduce  a  new  space  ((F,^)^)  consisting  of  all  compactly  sup¬ 
ported  measures  p  on  Q  with  vanishing  moments  for  P^,  i.e.,  all  integrals  of 
polynomials  from  P^  with  respect  to  p  are  zero.  If  we  assume 


/  /  ^x,y)diy{y)dp{x)  =  /  /  ^x,y)dp{x)diy{y) 
Jn  Jn  Jn  Jn 


for  all  p,iy  in  and  _ 

u{iy  *  $)  >  0 

for  all  nonzero  u,  it  is  easily  checked  that 


{u,p)  :=  u{p*^) 

forms  an  inner  product  on  {{P^)q).  Then  we  have  the  following  theorem: 


Native  Spaces 


305 


Theorem  8.  Under  the  above  assumptions,  {(jP^)o)  isometrically  iso¬ 
morphic  to  Furthermore,  the  native  space  N^(Q)  is  equivalent  to 


The  proof  of  Theorem  8  in  [2]  is  quite  hard.  It  involves  weak*  topology 
and  the  Krein-Milman  theorem.  So  far,  Theorem  8  is  the  best  result  concern¬ 
ing  interpretation  of  the  dual  of  the  native  space  as  a  space  of  measures. 


§4.  Construction  via  Tempered  Test  Functions 

Starting  from  [4]  there  is  an  approach  to  native  spaces  via  generalized  Fourier 
transforms  in  the  sense  of  Gelfand  and  Shilov.  Here,  we  want  to  avoid  distribu¬ 
tions  and  generalized  Fourier  transforms  as  far  as  possible.  The  key  point  is  to 
use  variational  equations  on  spaces  of  tempered  test  functions  as  a  convenient 
substitute  for  generalized  Fourier  transforms. 

Let  <5(n)  denote  the  space  of  tempered  test  functions  in  the  sense  of 
Laurent  Schwartz  with  supports  contained  in  ft,  and  define  as  the 

space  of  tempered  test  functions  with  support  in  ft  and  vanishing  moments 
up  to  order  m.  For  all  v,w  £ 


{v,w)^:=  /  /  ^{x,y)v{x)w{y)dxdy 

Jn  Jn 

is  a  bilinear  form,  and  we  would  like  to  base  a  second  construction  of  the 
native  space  on  it.  To  this  end,  it  would  be  a  reasonable  possibility  to  define  a 
property  like  “tempered  conditional  positive  definiteness”  to  require  that  this 
form  is  positive  definite  on  The  result  would  be  a  different  theory, 

but  we  want  to  blend  this  approach  into  our  previous  setting.  Thus  we  look 
at  conditions  that  allow  to  relate  this  bilinear  form  to  the  earlier  one. 

Following  [1],  we  assume  a  continuous  positive  function  :  1R‘^\{0}  — 3R 
exists  such  that 

{v,w)^  =  {2'k)~^  ip{x)v{x)w{x)dx  (5) 

JR^ 


for  all  G  <S^(0).  Here 


v{w)-.=  f 

Jn 

denotes  the  classical  Fourier  transform  of  u.  By  approximation  of  functionals 
from  (Pm)^  by  regular  distributions  generated  by  functions  from  Iske 

[1]  proved  that  this  assumption  is  slightly  stronger  than  c.p.d.  of  $  on  IR  , 
and  that  =  (u,  holds  for  all  v,w  ^  S^{ft). 
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Definition  9.  If  a  function  $  of  difference  form  =  0(x  —  y)  with 

a  continuous  and  even  function  (j)  on  satisfies  (5),  we  call  $  variationally 
positive  definite  (v.p.d)  of  order  m>0  on 

Definition  10.  Let  be  v.p.d.  of  order  m  >  0  on  A  complex-valued 
function  f  is  in  the  space  if  and  only  if  f  6  C(fl)  and  there  exists  a 

constant  c{f)  such  that 

I  f  f{x)v(x)dx\  <  c{/){  /  /  ^{x,y)v{x)v{y)dxdyy^^  for  allv  e 

Jn  Jn  Jn 

Theorem  11.  Let  Q,  be  open  and  $  be  v.p.d.  of  order  m  >  0  in  IR^.  Then 
N^(Q)  =  Furthermore,  is  isometrically  isomorphic  to 

Theorem  11  provides  a  nice  unification  of  the  theories  of  Weinrich  [6]  and 
Iske  [1].  Their  work  is  based  on  (-P^)n  and  respectively.  The  proof  of 

Theorem  11  is  rather  involved  [2]. 


§5.  Embedding  Theorems 

We  now  construct  continuous  embeddings  of  native  spaces  into  well-known 
spaces.  Madych  and  Nelson’s  discovery  that  A^$(1R'^)  C  C'(1R‘^)  can  be  re¬ 
garded  as  the  first  step  towards  embedding  theorems,  but  it  was  just  an  in¬ 
clusion  result.  In  this  paper,  all  the  embedding  theorems  concern  continuous 
embeddings  with  respect  to  the  topologies  of  the  spaces.  Even  the  embeddings 
of  native  spaces  into  L2  spaces  can  be  nontrivial,  provided  that  the  underlying 
domains  are  unbounded  (see  [5]  for  the  bounded  case). 

In  this  section  we  first  assume  $  to  be  v.p.d.  of  order  0  on  JR.^  with  a 
positive  classical  Fourier  transform  ip  G  Li{M^)  of  <p  with  ^{x,y)  =  4>{x  —  y). 
All  functions  /  of  the  form  (1)  have  a  classical  Fourier  transform 

j=l 

and  there  is  an  isometry  B  :  R^{{Po)q)  f ! \fp  mapping 

these  functions  into  Z/2(1R^)-  It  is  now  easy  to  see  that  the  equation  /  = 
holds  for  all  functions  in  R^{{Po)q)  and  its  closure  which  can 
be  identified  with  the  native  space  of 

Theorem  12.  For  variationally  positive  dehnite  functions  on  of  order 
zero  with  a  positive  Li  Fourier  transform  p,  the  functions  in  the  native  space 
of  $  have  Fourier  transforms  of  the  form  yJTp  •  g  with  an  L2  function  g.  The 
native  space  for  $  can  be  continuously  embedded  [2]  into  L2(M.^). 

The  last  statement  was  generalized  in  [2]  to 
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Theorem  13.  Let  $  be  symmetric  and  translation-invariant  on  Lt  x  Q,  and 
c.p.d.  of  order  m  >  0  on  a  domain  C  containing  points 
which  uniquely  determine  polynomials  of  If  there  exists  a  positive 

continuous  g  G  which  decays  exponentially  at  inhnity  and  satisfies 


I  \p{x)(l){x)\g{x)dx  <  00 

Jn 

for  all  p{x)  e  -P^(n),  then  can  be  continuously  embedded  in 

Theorem  12  characterizes  native  spaces  as  spaces  of  functions  whose 
Fourier  transforms  lie  in  a  weighted  L2  space.  The  same  holds  for  Sobolev 
spaces  on  and  this  similarity  can  be  used  to  derive  theorems  for  embed¬ 
ding  of  native  spaces  into  global  Sobolev  spaces  on  IR^.  For  embeddings  of 
local  native  spaces  on  domains  fl  C  we  refer  the  reader  to  the  fact  (proven 
in  [2]  and  [5])  that  functions  in  native  spaces  always  have  an  extension  to  the 
largest  domain  where  $  has  the  c.p.d.  property.  This  yields  embeddings  of 
local  native  spaces  into  spaces  of  restrictions  of  global  Sobolev  spaces  for  glob¬ 
ally  defined  functions  but  the  case  of  purely  locally  defined  $  is  unsolved. 

If  $  is  v.p.d.of  positive  order  m  on  R^^,  the  function  (p  of  (5)  will  have  a 
singularity  at  zero,  and  thus  the  notion  of  Fourier  transforms  needs  generaliza¬ 
tion.  We  simply  view  (5)  as  a  variational  property  satisfied  by  the  generalized 
Fourier  transform  ^  of  and  we  want  to  prove 

Theorem  14.  For  v.p.d.  functions  $  on  R^  of  order  m  >  0  jtlie  functions  f 
in  the  native  space  of  $  have  generalized  Fourier  transforms  f  —  •  g  with 

an  L2  function  g,  where  the  generalized  Fourier  transform  of  f  is  defined  via 
the  variational  property 

J  f  .  w  —  J  f  -w  for  all  w  G  <S^(n). 


Proof;  We  take  two  functions  u,  w  G  and  form  the  function  /„ 

Then  (5)  yields 

J  fy  =  (27r)"‘^  j  ipwv 

=  {2Tr)~'^  j  wy^y/ipv  (6) 

=  i2TT)~^  j  Wy/ipB{fy), 


if  we  define  B{fy)  :=  G  F2(R‘^).  This  maps  isometrically  into  L2(R'^), 
because  the  canonical  inner  product  of  such  functions  is 


ifuJv) 


$  = 


ipuv. 


Now  (6)  carries  over  to  the  closure,  i.e.  the  native  space,  and  it  yields  the 
desired  result.  □ 
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Review  of  Some  Approximation  Operators  for 
the  Numerical  Analysis  of  Spectral  Methods 


Yvon  Maday 


Abstract.  This  paper  reviews  some  operators  that  are  used  in  the  nu¬ 
merical  analysis  of  spectral  and  spectral  element  methods.  We  motivate 
the  introduction  of  these  different  operators  and  sketch  their  approxima¬ 
tion  properties.  Finally,  we  apply  them  to  derive  optimal  error  estimates 
for  spectral  type  approximations  of  the  solution  of  elliptic  partial  differen¬ 
tial  equations. 


§1.  Introduction 

Spectral  type  methods  are  high  order  discretizations  that  allow  to  compute 
approximate  solutions  of  partial  differential  equations.  The  recent  version  of 
spectral  approximations  is  based  on  the  Galerkin  approach  where  the  varia¬ 
tional  statement  (equivalent  to  the  strong  formulation  of  the  PDE)  is  set  on 
discrete  spaces  of  test  and  trial  functions.  For  instance,  let  us  consider  the 
problem:  Snd  u  e  X  such  that 

a(u,v)  =  (f,v),  VveX,  (1) 

where  X  is  some  Hilbert  space,  and  a  is  a  continuous  bilinear  form  over  X. 
The  general  Galerkin  approximation  of  this  problem  first  requires  the  choice 
of  a  family  of  discrete  spaces  X^  C  X,  where  iV  is  a  parameter  that  tends  to 
infinity  and  is  related  to  the  dimension  of  the  discrete  space  Xn-  The  discrete 
problem  is  then  stated  as  follows;  find  un  E  Xn  such  that 


a{uN,VN)  -  {f,VN),  \fvN  e  Xjv.  (2) 

The  basic  general  hypothesis  that  makes  problem  (1)  well-posed  is  that  a  is 
continuous  and  a-elliptic  over  X  (i.e.  3  a  >  0  such  that  o(u,u)  >  Ck:|lu||^  for 
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all  It  e  X.  These  properties  remain  true  over  each  (since  Xj^  C  X);  thus 
(2)  is  also  well-posed  for  each  N.  In  addition,  the  solution  satisfies 

<  c  inf  ||u-ujv|U-  (3) 

The  constant  c  that  appears  in  (3)  is  the  quotient  of  the  continuity  constant 
of  a  with  the  ellipticipty  constant  a  and  is  thus  independent  of  Xjv- 

Going  back  to  spectral  methods,  the  definition  of  Xjv  involves  polyno¬ 
mials,  and  in  the  most  simple  cases  (we  shall  see  more  general  examples  in 
Section  5)  we  have  =  X  fl  where  P;v  represents  the  set  of  all  poly¬ 
nomials  of  (partial)  degree  less  than  or  equal  to  N.  Here  N  is  the  parameter 
responsible  for  the  convergence  of  the  method.  Due  to  (3),  one  ingredient  in 
the  numerical  analysis  of  the  spectral  method  is  the  approximation  properties 
of  the  space  of  polynomials  for  given  functions.  The  classical  analysis  of  the 
approximation  properties  of  polynomials  is  done  in  terms  of  L°°-norms.  This 
is  not  completely  appropriate  for  our  purpose  since  most  often  X  is  a  Hilbert 
space  (generally  or  spaces),  and  the  approximation  properties  have  to 
be  measured  with  these  norms.  If  a  rate  of  convergence  (with  respect  to  N) 
on  the  best  fit  inft;^^^/,/  is  sought  after,  some  regularity  has  to  be 

assumed  over  u.  In  Section  2,  we  give  a  survey  of  these  best  approximation 
results  depending  on  the  regularity  of  the  function  we  want  to  approximate. 
We  first  analyze  the  L^-best  fit  and  then  the  H^-hest  fit.  The  main  ingredient 
in  this  analysis  relies  on  the  Legendre  basis  that  is  composed  of  the  orthogo¬ 
nal  polynomials  for  the  standard  Lebesgue  mesure  over  the  interval  (— 1,  -f  1), 
These  polynomials,  denoted  as  (Ln)„,  are  defined  by;  degree(L„)  =  n, 

LnW  =  1,  (4) 

(5) 

They  satisfy  some  standard  properties  (actually  valid  for  most  families  of 
orthogonal  polynomials) 


A{L„)  =  -^({l-e)^  =  n{n  +  l)L„,  (6) 

that  one  can  translate  by  saying  that  the  Legendre  polynomials  are  the  eigen¬ 
vectors  of  the  (Sturm-Liouville)  operator  A.  Since  this  is  a  possible  basis  set 
for  the  implementation  of  problem  (2),  this  gives  the  name  of  spectral  to  the 
methods  we  shall  consider  hereafter,  and  that  have  been  first  analyzed  in  [10]. 
We  refer  also  to  [6]  and  [3]  for  more  recent  surveys  on  the  numerical  analysis 
of  these  methods.  In  Section  3,  we  introduce  the  notion  of  numerical  integra¬ 
tion  and  the  interpolation  operator,  two  notions  that  are  naturally  quite  close 
and  that  allow  to  transform  the  “theoretical”  approximation  method  into  a 
“applicable”  one.  In  Section  4,  motivated  by  the  analysis  of  the  Stokes  prob¬ 
lem,  we  introduce  a  new  operator,  that,  in  opposition  to  the  previous  ones,  is 
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uniformly  stable  (in  N)  both  in  the  L^-norm  and  the  i?^-norm  and  possess 
optimal  approximation  properties.  It  has  to  be  said,  beforehand,  that  in  the 
precise  analysis  of  these  spectral  (or  polynomial)  approximation,  the  Bern¬ 
stein  inequality  runs  counter  to  most  standard  tools  that  generally  allow  for 
deriving  approximation  results  for  a  new  operator  from  an  already  analyzed 
one.  This  Bernstein  inequality  tells  about  the  equivalence  of  norms  on  the 
finite  dimensional  linear  space  of  polynomials.  It  is  well  known  that,  for  any 
function  in  the  norm  is  smaller  than  the  iif^-norm;  of  course  this  is 
true  in  particular  on  polynomials: 

V0A/-  G  PiV,  ||0iv||L2  < 

Since  all  norms  are  equivalent  on  P^v,  there  exists  a  constant  (obviously  de¬ 
pending  on  iV)  such  that 

V(/)jv  £  Pw,  <  c(iV)||0iv|U2- 

The  behaviour  of  this  constant  is  made  precise  by  the  Bernstein  inequality 

V<^iv  e  Pn,  \\(f>N\\m  <  cN^\\4>n\\l^^ 

where  c  no  longer  depends  on  N.  This  estimate  is  optimal  (in  the  sense  that 
there  exists  a  sequence  of  polynomials  such  that  the  ratio  of  the  iJ^-norm  over 
the  I/^-norm  scales  like  0{N^))y  but  is  bad  as  regards  the  ratio  of  convergence 
rate  between  the  F^-best  fit  and  the  L^-best  fit  that  scales  like  as 

we  shall  see  below. 

In  the  first  three  sections,  the  domains  where  the  functions  live  will  be 
very  simple,  actually  too  simple  to  tackle  real  life  problems;  indeed  these  are 
bricks  equal  to  (-1, 1)*^  where  d  =  1,2  or  3.  The  generalization  of  spectral 
methods  to  more  complex  geometries  is  done  by  combining  two  key  ingredi¬ 
ents:  the  mapping  of  bricks  onto  curved  bricks  through  regular  mappings,  and 
domain  decomposition.  We  give  some  hints  about  this  generalization  in  §5. 


§2.  Hilbert  Type  Projection  Operators 

Let  us  start  with  the  one-dimensional  case.  In  L^(— 1, 1),  we  consider  the  set 
Pjv(-l,l)  of  all  polynomials  of  degree  <  N.  From  the  Weierstrass  density 
theorem,  we  know  that  any  element  (j)  in  L^(— 1, 1)  can  be  written  as 


'A(C)  =  i;rMC), 

n=0 


(7) 


where  the  convergence  of  the  series  holds  in  L^.  The  coefficients  can  be 
derived  from  0  thanks  to  the  orthogonality  of  the  Legendre  basis  as  follows: 


2 


312 


Y.  Maday 


Next,  from  (6)  we  derive  that 


f 

7-1  n 


ALniO 


dC, 


i{n  +  1) 

noticing  that  A  is  symmetric,  and  assuming  0  regular  enough,  we  derive  that 
2ti  -f- 


If  we  iterate  this  argument  p  times,  we  obtain 


2n  +  l 


so  that,  the  following  simple  relation  holds  between  the  Legendre  coefficients 
of  (f)  and  of 

Next,  let  TTiv  denote  the  L^(-l,  l)-projection  over  F;v(-1,  !)•  Going  back  to 
(6),  we  deduce  from  (7)  and  (5)  that 


N 


(8) 


n=0 


SO  that 


oo  oo  ^ 

E  ^"in(0=  E  — 4n^)"£n(C) 

n=N+l  n=N'+l  ‘  / 

and,  by  Parseval 

n=N+l 

Slsl*  I  Wl'i’sVi 

1  °°  9  1 


n=0 


We  have  thus  proven  that,  for  any  (j)  in  the  domain  V[A^]  of  A^, 

U  -  <  c(p)N-^'’U’'mLH-x,x)- 

It  is  easy  to  check  that  1, 1)  C  hence  the  following  theorem  (due 

to  Canuto  and  Quarteroni  [7]),  proven  here  for  even  values  of  r,  holds  for  any 
r  thanks  to  an  argument  of  interpolation  between  Sobolev  spaces: 
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Theorem  1.  For  any  real  number  r  >  0,  there  exists  a  constant  c  >  0, 
depending  only  on  r  such  that,  for  any  function  (f)  e 

110  -  7rAr(0)||L2(_i,i)  <  ciV”^l|0||i/r(_i,i).  (9) 

Let  us  denote  now  by  the  L^((— 1, 1)^^)  orthogonal  projection  operator 

over  the  set  Piv((-l,l)^)  of  all  polynomials  of  degree  <  N  with  respect 
to  each  variable.  By  Pubini’s  theorem,  IIjv  =  Trjv  <S>  ttat,  in  2D  and  11  = 

TTAT  (8)  ottn  (2)  oTTiv,  in  3D.  By  tensorizing  (9)  we  derive 

Theorem  2.  For  any  real  number  r  >  0,  there  exists  a  constant  c  >  0, 
depending  only  on  r  such  that,  for  any  function  0  G  1}^}, 

110  -  nAr(0)|lx,2((_i,l)i)  <  ciV“^||0||H^((_l,l)d). 

We  are  now  in  a  position  to  tackle  the  approximation  in  the  norms. 
First,  we  consider  a  function  0  €  1),  with  r  >  1,  It  is  quite 

immediate  to  check  that  the  polynomial  0jv(C)  =  /f  i belongs 
to  Fa/^(-1,  1),  vanishes  at  C  =  -1,  and  satisfies 

0iv(l)  =  J  -  J 

=  -  .^(-1)  =  0, 

and  hence  is  an  element  of  1)  D  1).  Finally  it  is  a  good  ap¬ 

proximation  of  0,  since  from  Poincarre’s  inequality  and  Theorem  1, 

||<A  -  Iji'ivIlHlC-l,!)  ^  ‘=11^  “ 

<c||g-7rjv-i(^)lU=(-M) 

<  II  <  cA^^~^||0||Hr(_l,l). 

Let  us  introduce  now  the  orthogonal  projection  operator  from  F^(-l,  1) 
onto  Pn(-1,  1)  n  1)5  we  can  state  the  following  result  (due  to  Maday 

and  Quarteroni  [15]): 

Theorem  3.  For  any  real  number  r  >  1  and  any  real  number  0  <  s  <  1, 
there  exists  a  constant  c  >  0,  depending  only  on  r  and  s  such  that  for  any 
function  <j>  G  Hq{—1,  1)  0  1), 

110  -  7r];)“(0)IUM-U)  ^  ciV''“ni0llHn-bi)- 

Proof;  The  theorem  has  been  obtained  for  s  =  1.  For  s  =  0  it  is  obtained 
through  a  standard  Aubin-Nitsche  duality  argument,  and  then  for  any  5  by 
interpolation  between  Sobolev  spaces.  □ 
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Remark.  At  this  point  it  has  to  be  said  that  the  projection  operator  ttat 
does  not  have  optimal  approximation  properties  in  the  H^-norm,  the  only 
(non-improvable)  property  that  can  be  obtained  is 


110  -  7riv(0)||//i(_i,i)  <cN^ 

We  refer  to  [3]  for  details  and  counter-examples. 

Remark.  It  may  also  be  interesting  to  note  that,  despite  their  definition,  the 
previous  operators  have  stability  properties  in  various  norms.  First  for  the 
L^-operator,  we  have 


which  is  related  to  what  we  have  indicated  in  the  previous  remark,  but  also 

which  is  rather  suprising  since,  from  this  (non-uniform)  stability  property,  the 
/fo -projection  operator  can  be  extended  to  (irregular)  functions  of  L^!! 

Again  by  tensorization  of  the  results  of  the  one  dimensional  Theorem  3, 
we  exhibit  a  polynomial  that  approximates  regular  functions  in  1)^) 

well,  from  which  we  derive  approximation  properties  on  the  multidimensional 
projection  operator  from  R^((-l,  l)*^)  over  Piv((-1, 1)^)  H  i^o  ((-1, 1)^) 
both  in  /f^-norm  and  in  L^-norm  (derived  by  duality): 

Theorem  4.  For  any  real  number  r  >  1  and  any  real  number  0  <  s  <  I, 
there  exists  a  constant  c  >  0,  depending  only  on  r  and  s  such  that,  for  any 
function  (j)  G  l)'^)  fl  i/^((-l,  1)^^), 


110  “  nj^°(<;6)||//5((_i^i)d)  <  cTV®  '^||0||^r-((_i^i)<i).  (11) 

These  results  can  be  completed  in  order  to  derive  a  whole  scale  of  ap¬ 
proximation  projectors  in  higher  order  norms.  These  are  required,  e.g.  for 
the  analysis  of  the  approximation  of  fourth-order  problems.  The  general  re¬ 
sult,  concerning  the  orthogonal  projection  operator  1)*^  D 

Hq  ((~1,  l)'^  onto  Pjv((— 1,  l)'^)  n  Hq  ((—1, 1)*^)  is  given  in  the  following  theo¬ 
rem  (due  to  Maday  [11]  in  ID,  see  also  [3]  for  the  extension  to  2  and  3D): 

Theorem  5.  For  any  real  number  0  <  a  <  p  and  any  0  <  s  <  p  <  r,  there 
exists  a  constant  c  >  0,  depending  only  on  r,  5,  p,  a  such  that,  for  any  function 


||i^  -  <cN^  ’^II0IIh'^((-i, !)■')• 
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Remark.  A  final  remark  on  these  operators  is  that  improved-approximation 
results  in  negative  norms  are  also  true,  and  can  be  obtained  in  a  classical  way, 
by  further  refering  to  the  Aubin-Nitsche  duality  argument.  Hence,  Theorem 
5  is  also  valid  for  negative  values  of  s. 

These  results  allow  us  to  prove  that  the  approximation  of  most  elliptic 
variational  problems  by  spectral  methods  is  optimal.  As  an  example,  let  us 
consider  the  (non-constant)  Laplace  problem  on  a  cube  Q,  =  (-1,1)^:  given 
a  3  X  3  matrix,  symmetric  and  uniformly  positive  definite,  we  consider  the 
problem  of  finding  u  G  Hq  (O)  such  that 

— div[Agrad]tt  —  f.  (12) 

The  approximation  then  consists  in  finding  an  element  un  in  Xn  =  Fjv(^^)  H 
Hq{Q)  such  that 


/  AVunXvn  =  /  fvN,  Vuiv  ^  A'iv.  (13) 

Jq  Jn 

Assuming  that  u  G  we  deduce  from  (3)  and  (11)  that 

ll'ii  - 

As  hinted  in  the  introduction,  this  problem  is  numerically  intractable; 
indeed  the  implementation  of  (13)  requires  the  computation  of  the  two  in¬ 
tegrals  appearing  on  the  left-  and  the  right-hand  sides  of  this  equation.  The 
exact  computation  is  most  often  impossible,  and  certainly  numerically  not  fast 
enough.  The  use  of  numerical  integration  rules  is  the  cure  to  this  problem, 
but  in  order  to  combine  efficiency  and  precision,  following  Gottlieb  [9]  and 
Mercier  [17],  we  refer  to  the  use  of  Gauss  type  quadrature  rule.  Indeed,  they 
are  well  known  to  be  well  suited  for  the  integration  of  polynomials. 

§3.  Interpolation  Operators 

Between  the  different  numerical  quadrature  rules  over  (-1, 1),  well  suited  for 
polynomial  integration,  we  shall  quote  here  the  Legendre- Gauss  and  Legendre- 
Gauss-Lobatto  ones.  We  refer  to  [2]  for  more  details.  For  the  sake  of  com¬ 
pleteness,  we  recall  the  definition  of  these  formulae: 

Theorem  6.  (Gauss  formula)  For  any  real  number  n,  there  exists  a  unique 
set  of  points  —  1  <  Cf  <  C2  <  '  ‘  ‘  <  Cn  <  ^  unique  set  of  positive 

weights  such  that  for  any  polynomial  (j)  G  P2n-i(-l,l)j  the  following 
equality  holds: 
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Theorem  7.  (Gauss-Lobatto  formula)  For  any  real  number  n,  there  exists 
a  unique  set  of  points  -1  =  <  Cf  <  ^  unique  set 

of  positive  weights  p”  such  that  for  any  polynomial  (}>  G  P2rt-i(-lil)>  the 
following  equality  holds: 

/%{ck  =  E^K"K. 

‘'-1  i=0 

Prom  now  on,  we  shall  assume  that  the  degree  of  the  polynomials  for  the 
approximation  is  fixed  to  be  N,  and  we  shall  use  +  1  points  either  of  Gauss 
or  Gauss-Lobatto  type.  For  the  sake  of  simplicity,  these  points  will  be  denoted 
with  no  superscript,  i.e.  in  all  of  what  follows,  we  set  G  =  and 
We  recall  that  these  points  are  the  roots  (resp.  the  extrema)  of  the  Legendre 
polynomials;  more  precisely,  we  have 

Vz,  LN+i(Ci)  =  0,  (resp.  (1  -  =  0). 

After  tensorization,  these  one  dimensional  quadrature  rules  easily  provide 
quadrature  rules  on  the  square  and  on  the  cube  defined  as  follows  (e.g,  in 
2D  for  the  Gauss  Lobatto  formula): 

N  N 

GL  i=o  i=0 

The  problem  that  is  actually  implemented  is  then  the  following:  find  an  ele¬ 
ment  U]\f  in  Xn  such  that 

AVunS/v^  =  ^vn^Xn.  (14) 

GL  GL 

Even  in  the  case  where  A  is  constant,  at  least  in  more  than  one  dimension, 
the  left-hand  side  is  not  exactly  computed.  The  problem  is  no  longer  of  the 
form  (1),  and  the  abstract  theory  has  to  be  generalized  in  order  to  handle  this 
problem  as  well. 

Here  is  not  the  place  to  detail  this  generalization  (see  [3],  where  the 
complete  analysis  is  performed)  but  it  is  natural  that  the  a-ellipticity  of  the 
bilinear  form  on  the  left-hand  side  of  (14)  is  again  one  of  the  key  ingredients 
and  has  to  be  satisfied.  This  follows  from  the  property,  proven  in  [7] 

€  PAr(-l,  1),  51 A  >  f  4>%{C)d(:. 

GL 

Prom  this  property  it  can  be  easily  derived  that  the  solution  to  (14)  exists 
and  is  unique. 

The  approximation  properties  of  the  polynomial  interpolation  operator 
over  the  Gauss-Lobatto  nodes  is  of  great  importance  in  the  error  bounds.  Let 
zat  denote  this  operator  in  one  dimension: 

G  C°([-l,l]),  Ziv(0)  €  Pa^(-1,1)  and  Vz,0  <  z  < 

and  let  us  tensorize  it  in  order  to  get  a  two  (resp.  a  three)  dimensional  operator 
In  =  in  ®  in  (resp.  In  =  in®  in®  «n)-  The  properties  of  this  operator  have 
been  established  in  [12]  and  [2],  and  read  as  follows: 
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Theorem  8.  For  any  real  numbers  s  and  r  satisfying  r  >  (d  -h  s)/2  and 
0  <  5  <  1,  there  exists  a  positive  constant  c  depending  only  on  r  such  that 
for  any  </>  in  1, 1)'^)  the  following  estimate  holds 

||(/)  -  Jjv(</')||H«((-l,l)d)  <  (15) 

It  has  to  be  noticed  that  this  operator  requires  more  regularity  than  the 
projection  operator,  but  it  is  optimal  both  in  the  and  the  norms. 
It  has  also  to  be  recalled  that  in  the  classical  approximation  properties  in  the 
L°°  norm,  the  Lebesgue  constant  appears  as  a  pollution  of  the  approximation 
properties  of  the  interpolation  operator  as  regards  the  optimality  provided 
by  the  corresponding  best  fit.  This  is  not  the  case  in  the  L^-norm.  In  this 
direction  what  we  have  more  precisely  is  that,  for  any  function  </>  in  1), 


and  for  any  function  </>  in  1,1), 

Another  nice  property  of  this  operator,  that  has  some  importance  for  nonlinear 
PDE’s,  is  the  following  result:  for  any  polynomial  0m  €  FmI—Ij  1)? 

M 

||«jv0m||l2(-i,i)  <  c(l  +  ^)||0m1U2(_i,i). 

Here  no  duality  argument  allows  us  to  derive  from  the  previous  theorem 
improved  approximation  properties  in  negative  norms.  It  is  an  open  problem 
to  derive  such  results. 

The  numerical  analysis  of  problem  (13)  then  continues  by  noticing  that 

Y^fvjv  =  y]lN(f)vN, 

GL  GL 

which  is  one  of  the  ingredients  that  allows  to  prove  (see  [2]): 

Theorem  9.  Assume  that  the  solution  u  of  (12)  belongs  to  H^(fl),  that  the 
coefficients  in  A  are  very  regular,  and  that  the  data  f  belongs  to  Then 

the  solution  to  (13)  satisfies 

<  c(Ar^“^  11^11  ll/ll  j:fp(Q)). 

The  case  where  A  is  not  so  regular  can  be  handled  with  the  same  type 
of  arguments,  but  more  technical  tools  are  involved;  we  refer  to  [16]  for  more 
details.  It  is  interesting  also  to  note  at  this  level  that,  taking  into  account  non- 
homogeneous  Dirichlet  boundary  condition  is  very  simple  thanks  to  the  nice 
properties  of  the  interpolation  operator.  Indeed,  assume  that  the  solution  to 
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our  problem  (12)  has  to  satisfy  (instead  of  zero  Dirichlet  boundary  conditions) 
the  following  condition:  W|9n  =9  where  g  is  a  given  function  on  the  boundary 
of  fl.  Then,  naturally,  for  the  approximation,  we  look  for  a  polynomial  in 
P7v(f^)  such  that  (12)  holds,  and  in  addition 

UN\dn  =  ^n9, 

where  Xjv  is  the  operator  of  interpolation  defined  edge  by  edge  (respectively 
face  by  face)  from  (resp.  from  J^r).  Since  the  interpolation  operator  is 
optimal  both  in  and  in  H^,  it  results  by  an  argument  of  interpolation 
between  Sobolev  spaces  that  it  is  also  optimal  with  respect  to  the 
norm.  This  fractional  order  norm  is  the  natural  one  for  the  treatment  of  the 
boundary  terms.  It  has  also  to  be  stressed  that  neither  the  L^-projection 
operator  nor  the  if  ^-projection  operator  allow  such  an  optimality  nor  such 
ease  in  the  implementation. 

Next,  associated  with  the  Gauss  quadrature  formula,  we  can  also  define  an 
interpolation  operator,  denoted  as  jV  and  defined  as  follows:  G  C^([— 1, 1]), 

JnW  e  Piv+i(-l,  1)  and  Vz,  1  <  z  <  iV  +  1,  iiv(0)(CO  =  0(Ci). 

The  L^(— 1,  l)-approximation  properties  of  this  second  interpolation  operator 
are  also  optimal.  Unfortunately,  in  the  if  ^-norm  it  is  not  optimal;  for  instance 
it  is  readily  checked  that  ~  =  L^-i.  Recalling  that 

Vn.  = 

n[n  +  Ij 

it  is  then  easily  proven  that  j|Ljv+i  —  Tn-i|1//i(-i,i)  scales  like  while 

||L7v-i||//i{-i,i)  scales  like  0{N)\  jV  is  thus  not  stable  in  the  if^  norm. 

For  similar  reasons,  the  interpolation  operator  on  the  Gauss-Lobatto 
nodes  does  not  have  optimal  approximation  properties  in  the  if^(  — 1,  l)-norm. 
In  order  to  achieve  such  a  property,  we  have  to  refer  to  generalized  Gauss- 
Lobatto  rules  as  is  done  e.g.in  [1]. 

§4.  An  “Ideal”  Operator 

At  this  stage  there  is  no  operator  from  L^(-l,  1)  onto  the  set  of  polynomials 
that  has  optimal  approximation  properties  and  is  stable  both  in  the  and 
the  iif^  norm.  Such  an  operator  is  useful,  as  will  be  explained  below,  in  the 
analysis  of  the  Stokes  problem.  In  order  to  define  this  “ideal”  operator,  we 
fix  a  positive  real  number  X  and  a  cut-off  function  x  of  class  on  such 
that  X  is  equal  to  1  on  [0,1  —  A],  decreases  from  1  to  0  on  [1  ~  A,  1]  and 
vanishes  on  [l,oo].  Next,  with  each  positive  integer  A,  we  associate  as  in  [18] 
an  operator  with  values  inPA^(— l,l)n/fo(“l5l)  follows:  since  each 
function  (f)  in  1)  can  be  written  as  </>  =  <l>^{Ln+i  —  T„_i),  we  set 

TT^cj)  =  x(;^)</>”(-^'n+i  -  Tn-i).  Note  that  the  sum  above  is  finite  since 

X  has  a  bounded  support.  It  is  proven  in  [4]  that  this  operator  is  stable  both 
in  the  Hq  and  the  norms: 
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Theorem  10.  There  exists  a  constant  c,  independent  of  N  such  that  for  any 
function  <f)  €  iIo(-l,  1), 

||7r5^0l|L2(_i^i)  <  c1|(/)||l2(_i,i),  ||7r5^<^||/fi(-i,i)  <  ||01|hi(-i,i)-  (1®) 

It  is  an  easy  matter  to  verify  that  the  operator  leaves  invariant  all 
polynomials  of  Pajv(-1,  1)  H  1).  The  previous  stability  and  the  best 

fit  estimates  (9),  (10)  imply 

Theorem  11.  For  any  positive  real  number  r  and  any  real  number  0  <  s  <  r, 
there  exists  a  constant  c  >  0,  depending  only  on  r  and  s  such  that,  for  any 
function  (f>  G  1)  if  r  <  1  and  any  function  (j)  G  1)  H  1) 

if  r  >  1, 

\\(l)  -  n^(l>\\H^{-i,i)  < 

As  an  application  of  the  previous  result,  we  can  consider  the  problem 
of  finding  compatible  spaces  for  the  approximation  of  the  Stokes  equation. 
Under  variational  formulation,  this  problem  consists  in  finding  a  pair  (w,  p)  in 
X  Lo(n)  of  velocity  and  pressure  such  that 

f  VuVv-  f  pdivv  =  f  fv,  Vv  G 
Jci  Jn  Jn 

f  qdWu  =  0,  \fqeLl{n),  (18) 

Jn 

where  Z/o(^)  functions  with  zero  average.  It  is  well  understood 

now  that  the  spectral  approximation  of  the  Stokes  problem  based  on  polyno¬ 
mials  of  the  same  degree  leads  to  instabilities.  This  is  due  to  the  fact  that 
the  pressure  space  is  too  rich  in  comparison  to  the  velocity  space.  Indeed, 
there  exist  polynomials  q^  in  Piv((— Ij  1)^^)  such  that  Jj^g^vdivv^v  =  0  for  all 
VN  in  (Pjv((-l,l)‘^)nif^((-l,l)'^))‘^  (e.g.  qN{x,y,z)  =  LN{x)LN{y)LN(z)). 
Of  course  such  polynomials  (called  spurious  modes)  prevent  the  discrete  prob¬ 
lem  from  being  well-posed  since  it  prevents  the  definition  of  a  unique  pres¬ 
sure.  The  cure  is  well  known,  and  consists  in  depleting  the  pressure  space 
for  a  given  velocity  space.  In  [14]  the  pair  (Piv((— 1, 1)^^)  H  jEro((— 1, 1)*^))^  x 
Pw^-2((-l,  l)‘^)nTo{(“lj  1)^^)  proposed,  and  gets  rid  of  the  spurious 

modes.  It  is  known  as  the  Pat  x  Piv_2-method.  Actually,  what  is  looked  for 
is  a  pair  x  Mn  approximating  (ifo((-l,  1)'^))'^  x  Lo((-l,l)‘^)  well  and 
such  that  not  only  Vgjv  €  Mn,3vn,  f^pdivvjv  ^  0,  but  more  precisely,  in 
order  to  get  a  stable  method,  we  require  that 

Vgjv  6  AfiVjSwjv,  /  pdivuA^  >  /?|lviv||ffi((_i,i)<i)||aivllL2{(-i,i)d)5 

Jn 

where  /?  is  known  as  the  constant  of  the  inf-sup  condition.  The  behaviour 
of  jS  for  the  Pat  x  PAr-2”niethod  scales  as  0{N  2“)  (see  [2]),  and  it  has 

been  a  long  standing  question  whether  there  is  a  uniformly  stable  spectral 
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approximation  of  the  Stokes  problem.  It  has  to  be  said  that  the  nonuniform 
behaviour  of  the  inf-sup  constant  pollutes  the  accuracy  of  the  pressure,  but 
also  pollutes  the  convergence  properties  of  some  classical  solvers  for  the  Stokes 
problem  (see  [13]).  The  “ideal”  operator  introduced  above  allows  us  to  prove 
that  a  compatible  choice  is  the  Pat  x  P^iv-niethod  that  proposes,  for  the 
same  choice  of  velocity  space,  Pa7v((-1>  1)^)  nL§{(— 1,  l)'^)  to  be  the  pressure 
space.  The  following  result  is  due  to  Bernardi  and  Maday  [4] : 

Theorem  12.  For  any  real  number  A,  0  <  A  <  I,  there  exists  a  positive 
constant  P  independent  of  N  such  that,  for  any  integer  N  >  2/(1  -  A)  and 
any  qj,  G  PAiv((-l,  1)")  H  L§((-1,  l)^'), 


In 


Proof;  Let  be  any  polynomial  in  Pa7v((-1,  l)'^)  H  Lo((-l,  l)*^).  It  is  a 
standard  matter  (see  e.g.  Corollary  2.4  in  [8])  that,  to  qN^  can  be  associated 
a  (continuous)  element  v  in  [i/o((~l>  l)*^)]"^  such  that 


divv  =  qn  and  ||v||//i((_i,i)d)  <  c||giv||L2({-i,i)d)- 

The  problem  is  that  v  is  not  a  polynomial.  We  define  v/v  =  7^%  ®  in  2D 
and  V N  =  TT^  ®  TT^  TT^v  in  SB  for  which  we  derive  thanks  to  (13)  that 

Due  to  the  fact  that  leaves  invariant  all  polynomials  of  Pan(— 1?1)  Cl 
//■qC— Ij  l)j  we  deduce  that  qj\/div(v/\j^  -  v)  =  0,  and  thus 

/  qjvdivv/v  =  /  qwdivv  =  q%, 

Jn  Jn  Jn 

which  concludes  the  proof  with  /?  =  A  □ 

§5.  Extension  to  Domain  Decompositions 

In  the  spectral  method  history,  the  need  to  tackle  more  general  domains  was 
recognized  early.  In  this  direction.  Patera  has  proposed  in  [19]  the  spectral 
element  method  that  combines  the  accuracy  of  the  spectral  method  with  the 
flexibility  of  the  domain  decomposition  methods.  The  idea  is  to  introduce  a 
partition  of  the  domain  D  as  a  union  of  nonoverlapping  subdomains; 

n=uf=in‘,  = 

In  addition,  we  assume  that  each  subdomain  is  associated  with  a  regular 
one-to-one  mapping  that  maps  the  brick  (—1,1)^  onto  and,  for  the 
time  being  at  least,  we  make  the  following  assumptions: 


Review  of  Spectral  Operators 


321 


Assumption  1. 

'  an  entire  common  face  (in  3D),  or 
an  entire  common  edge,  or 

a  common  vertex,  or 

u 

Assumption  2.  The  two  parametrizations  of  the  previous  intersection 
0  n  n  ,  resulting  from  and  coincide. 

This  allows  us  to  define  the  discrete  space 

Xn  =  {VN  e  Hlin), o e  Pjv((-1,  !)■')} 

and  the  discrete  associated  problem  (2)  (or  its  implement  able  version  involving 
the  Gauss-Lobatto  quadrature  rule  over  each  as  in  (13)). 

The  main  ingredient  that  allows  us  to  prove  that  the  previous  scheme  is 
again  optimal  lies  in  the  definition  of  a  element  in  that  approximates  well 
a  given  regular  function  u.  This  is  done  easily  by  considering  the  element  ujv, 
defined  locally  over  each  subdomain  as  =  TnIu^qh  o^T*].  It  results 

from  Assumptions  1  and  2  that  is  actually  continuous  and  vanishes  over 
Oft.  Prom  (15)  it  is  an  optimal  approximation  of  u  in  the  sense  that 

Ik  “  (19) 

The  best  fit  in  is  certainly  as  good  as  the  proposed  v^,  and  the  spectral 

element  method  can  be  proven  to  be  an  optimal  approximation.  We  have  only 
sketched  the  numerical  analysis  of  this  approximation,  since  the  main  purpose 
of  this  paper  is  to  discuss  projection  operators.  It  is  fundamental  to  have  used 
here  the  interpolation  operator  to  construct  Ujv,  since  it  provides  a  globally 
continuous  function.  As  an  example,  the  use  of  the  iiT^-projection  operator 
would  not  have  given  rise  to  a  continuous  function  since,  for  a  given  function 
(j)  over  the  brick  (—1, 1)^^,  the  value  of  n]y}^((/))  over  any  face  depends  not  only 
on  the  value  of  (f)  on  the  given  face,  but  depends  on  4>  inside  the  whole  domain. 

We  want  to  end  this  section  by  giving  some  hints  on  the  “mortar  spectral 
element  method”  due  to  Bernard!,  Maday  and  Patera,  that  allows  to  relax 
assumptions  1  and  2  (and  even,  more  generally,  allows  to  combine  spectral 
methods  on  some  subdomains  with  different  finite  element  methods  on  others 
see  [5]).  Due  to  lack  of  space,  but  also  in  order  to  better  understand  the 
main  feature  of  the  projection  operators  that  is  at  the  basis  of  the  method, 
we  shall  consider  a  simple  two  dimensional  domain  0  =  (—1,2)  X  (-1.1) 
decomposed  into  3  subdomains  =  (—1,1)  x  (-1,1),  =  (1,2)  x  (—1,0) 

and  =  (lj2)  x  (0,1).  This  decomposition  violates  assumption  1  since  the 
_ 1  _ 2 

intersection  O  D  is  not  a  common  whole  edge.  We  want  nevertheless  to 
propose  a  discrete  method  that  will  allow  to  provide  an  optimal  approximation 
of  the  solution  u  of  (12)  (with  A  =Id  for  the  sake  of  simplicity).  The  discrete 
space  X^  that  we  propose  is  imbedded  in 

Vn  =  {m  €  L^(n), Ujv|fi*  6  IPw. =  0,  Wiv|fi2  =  over  n 


fj*"  n  a'  = 
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but  it  is  readily  checked  that  imposing  continuity  at  the  level  of  the  inter¬ 
face  X  =  1  will  rigidify  the  approximation  and,  in  the  general  case,  will  spoil 
the  accuracy  of  the  method.  In  order  to  relax  this  continuity  condition  (re¬ 
mind  that  it  is  inherited  from  the  requirement  that  X’^  C  X),  we  resort  to 
nonconforming  approximations.  We  shall  replace  the  continuity  condition  by 
requiring  that,  over  the  interface  2;  =  1,  we  impose  for  each  element  in  y}v 

J  [v~{l,y)  -  v'^{l,y)]'iljN{y)dy  =  0,  G  Fiv-2(-l,  1),  (20) 


where  v  =  and 


f  for  (a:,  T/)  G 

\v\^3  for  (x,?/)  G 


Since  t;“(l,2/)  has  to  vanish  for  y  =  ±l  (due  to  the  homogeneous  boundary 
conditions),  it  is  entirely  defined  by  the  -  1  conditions  in  (20);  in  particular 
choosing  ij)  in  Piv(— 1,1)  would  be  much  too  stringent.  The  elements  of  Y}v 
that  satisfy  (20)  constitute  the  space  X^  of  approximation.  The  method  is 
then:  find  u*^  G  X^  such  that 


K  K 

aN{u%,VM)  =  y2  f  Vu’i,VvN  =  Tf  fvN,  ^VN^Xl,.  (21) 


Since  is  no  longer  a  subspace  of  X,  the  ellipticity  of  the  bilinear  form  of 
this  problem  is  not  straightforward.  Nevertheless,  it  is  true  (and  here  it  is 
particularly  obvious  since  fl  dU.  ^  0).  This  argument  allows  us  to  check 
that  there  exists  a  unique  solution  to  (21).  In  order  to  derive  the  error 
bound  we  proceed  as  follows:  for  any  G 

=  yz  -  '^n)  -  yz  /  Vw;yvV(wJ^  -  wn) 

K  K 

=  y^l  -Ari(wJ/ -  Wjv)  -  /  Xw!^V{u*^- 

K  K  , 

=  VuV{u*^  -  wn)  -  52  /  Vwn^{u*^  -  wn) 


■  WN) 


-I 


/ 

1  -  i'^N  - 


SO  that,  from  (20)  we  derive  that  for  any  ip  G  P/v-2(“l,  1) 


K 


allitJV  -  wnIIJ  <  52  /  v(u  - 'u;yv)V(?/J^  -  wyv) 

f  du 
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It  follows  from  the  previous  inequality  that 


n  ,1  ^  II  II  ,  /oo^ 

a||n^  -  Wivll*  <  cl|u-u;iv||*  +  sup  ^ - r — rj - .  (22) 

VWGX*  Pn\\* 


By  choosing  ifj  equal  to  it  results  that 

/x=i[5i  -  -  vji]dy 


sup 


Ikivll* 


<CN  ^l|ul|/fr(f2). 


It  remains  to  choose  a  good  approximation  wjv  of  u  in  to  take  into  account 
the  first  term  on  the  right  hand  side  of  (22).  This  is  done  by  noticing  that,  for 
any  (j)  G  1),  the  element  (pN  of  P^vC-l,  1)  H  1)  that  satisfies 


fj 


[(pN  -  (p]'^N(y)dy  =  0,  'i'lpN  €  Fiv-2(-l5  l)j 


is  nothing  other  than  (pN  =  '^]fi(p)-  Indeed,  we  remark  that,  for  any  xn  ^ 
PAr(-l,  1)  n  1),  then  Xn  ^  Pn-2(”1,  1),  thus 

j  [(pN  -  (p]xN{y)dy  =  -  J  -  (PYxNiy)dy- 

The  choice  of  a  good  element  is  done  as  follows.  We  first  set  — 

In{un\ci'<^)  that  is  an  element  of  Y^.  We  then  set  for  k  =  2,3, 

and  build  wn\qi  by  adding  to  wff  the  correction  7rjy^(u;]y  —  'w]^){y) 
so  that  it  satisfies  (20).  Due  to  the  optimal  approximation  properties  of  the 
operator  both  in  the  and  in  the  /f^-norms,  we  deduce  that  the  mortar 
spectral  approximation  (21)  is  optimal  in  the  sense  that  (19)  still  holds. 
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Abstract.  The  H-basis  concept  allows  an  investigation  of  multivariate 
polynomial  spaces  degree  by  degree.  In  this  paper,  we  mention  its  con¬ 
nection  to  the  Grobner  basis  concept,  characterize  H-bases,  show  how  to 
construct  them,  and  present  a  procedure  for  simplifying  polynomials  to 
their  normal  forms.  Applications  will  be  given  in  [8]. 


§1.  Introduction 

We  consider  11,  the  ring  of  polynomials  in  cci, . . . ,  with  coefficients  from  an 
infinite  field  K,  i.e.  11  =  K[rci, . . .  ,x„],  and  the  subsets  11^  of  all  polynomials 
of  degree  at  most  d.  In  many  applications,  one  is  interested  in  getting  a  basis 
or  a  generating  set  for  the  linear  vector  space  /  fl  11^,  where  /  C  n  is  an 
ideal.  Having  an  H-basis  {/i, .  •  • ,  /s}  for  /,  then  the  set  of  all  pi  •  fi  with 
Pi  e  Ud-degifi)^  i  =  1, . . . ,  s,  generates  /  H  H^  as  a  linear  vector  space.  Thus 
the  H-basis  concept  is  a  tool  for  transforming  a  non-linear  problem  in  H  into 
a  problem  in  one  (or  in  a  series  of)  finite  dimensional  linear  space(s)  H^. 

H-bases  were  introduced  first  by  Macaulay  [4].  His  original  motivation 
was  the  transformation  of  systems  of  polynomial  equations  into  simpler  ones. 
The  power  of  this  concept  was  not  really  understood,  presumably  because 
of  the  lack  of  facilities  for  symbolic  computations.  When  Computer  Algebra 
Systems  came  up,  Grobner  bases  (G-bases  for  short)  were  used  instead  of 
H-bases.  These  bases,  originally  invented  by  Buchberger  [2]  for  computing 
multiplication  tables  for  factor  rings,  are  now  also  applied  for  simplifying 
some  problems  in  Numerical  Analysis,  see  [5]. 

The  G-bases  give  generating  systems  not  to  J  fl  H^  but  to  /  D  .F* ,  where 
Ft  C  H  is  a  linear  vector  space  of  dimension  z,  and  Fj  C  Fi+i  for  all  i  and 
H  =  Ui>oFi.  This  finer  decomposition  has  some  drawbacks.  For  instance  if 
an  ideal  is  invariant  under  an  affine  symmetry  group,  its  G-bases  are  typically 
not  invariant.  Since  the  spaces  Hd  are  invariant  under  affine  symmetry  groups, 
H-bases  do  not  destroy  such  symmetries. 

Many  of  the  problems  in  applications  which  can  be  solved  by  Grobner 
techniques  can  also  be  treated  successfully  with  H-bases.  In  [7]  we  gave  an 
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overview  of  such  problems.  In  the  present  paper,  we  describe  briefly  the 
underlying  concept  of  grading  rings,  which  leads  to  G-  and  H-bases,  and 
present  some  properties  characterizing  H-bases.  In  contrast  to  [7],  where  we 
only  gave  a  class  of  examples  of  H-bases,  we  present  here  the  construction  of 
an  H-basis  for  zero-dimensional  ideals  /.  A  useful  tool  for  our  procedure  is 
the  so  called  normal  form  mapping  NF,  presented  in  Section  4,  which  projects 
n  orthogonally  to  the  ideal  /  provided  an  H-basis  of  I  is  given.  In  [8]  we  show 
how  these  normal  forms  can  be  applied  in  numerical  applications, 

§2.  H-bases  and  G-bases 

In  ring  theory,  rings  can  be  graded  by  an  ordered  monoid,  i.e.  by  an  abelian 
semigroup  F  with  addition  -f-  and  total  ordering  -<  satisfying 

7i  ^  72  =>  70  +  7i  ^  7o  +  72,  V70, 7i, 72  ^  T . 

There  are  two  major  examples  for  grading  H  by  an  ordered  monoid  F : 

n  =  0n(n,  ntnntn  c  V7i,72  e  r. 

The  first  one  is  the  H-grading  with  F  :=  INo, 

:=  {p  6  n  \  p  homogeneous  of  order  7}. 

The  ordering  of  F  =  No  is  the  natural  one.  The  second  example  for  gradings 
is  the  G-grading,  where  F  :=  IN”  and 

nhL7„)-=  |ceK}. 

F  =  IN”  is  ordered  by  an  admissible  term  ordering, 

0  i-<j=^i  +  k^j  +  k. 

Since  the  decomposition  of  H  into  the  sets  H^^^  is  a  direct  sum,  every  /  G  H 
has  a  unique  representation  f  =  Ylfi-  The  maximal  7  with  /  0  is  called 
the  maximal  part  of  /  ^  0,  M^'^\f)  for  short.  It  is  also  called  the  maximal  form 
in  the  H-case,  or  leading  monomial  in  the  G-case.  In  the  G-case,  MT)(/)  = 
lc{f)lt{f),  where  lc{f)  6  K  is  the  leading  coefficient  and  /£(/)  = 
the  leading  term.  The  maximal  form  of  /  7^  0  is  also  denoted  by  Mnif)- 

Definition  1.  {pi, . . .  C  I  is  called  a  basis  of  an  ideal  /  C  H,  briefly 
I  =  ifV  pel 


3pi,...,Pm  G  n  :  p  =  '^PkPk- 

k=\ 
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It  is  also  a  G-basis  or  H- basis  resp.  if  gi, ...  ,gjn  satisfy  in  addition 
max  lt{gk)lt{pk)  =  lt{p)  (G  -  basis), 

k=l 


or  max  deg{gkPk)  =  deg(p)  {H  —  basis). 
k=l 

Theorem  1.  Let  I  =  {pi,...,pm)-  Then  {pi,...,PTn}  is  an  H-basis  (G- 
basis  resp.)  if  and  only  if  the  least  ideal  containing  all  Mff(f),  0  ^  /  G  / 
(or  all  lt{f),  0  /  G  /  resp.)  is  generated  by  Mh(pi),  ...,  MniPm)  (or  by 

lt{pi), . . .  ,lt{pm)  resp.). 

This  theorem,  which  holds  mutatis  mutandis  for  arbitrary  graded  rings, 
is  proved  for  instance  in  [6].  An  immediate  consequence  of  it  is  that  every 
ideal  I  ^  (0)  has  an  H-  and  a  G-basis. 

G-bases  are  now  a  standard  tool  in  Computer  Algebra.  They  are  covered 
by  nearly  all  textbooks,  and  are  contained  in  almost  all  Computer  Algebra 
Systems.  The  grading  by  one-dimensional  linear  spaces  often  reduces  the 
computation  to  solving  a  series  of  one-dimensional  problems.  On  the  other 
hand,  the  construction  of  G-bases  is  often  difficult  or  even  impossible  because 
of  the  high  complexity  of  Buchberger’s  algorithm  for  computing  G-bases.  In 
addition,  in  many  applications  the  G-bases  allows  only  little  insight  into  the 
structure  of  a  solution  by  the  artificial  ordering  term  by  term. 

§3.  Characterization  of  H-bases  and  Normal  Forms 

Macaulay  introduced  H-bases  using  homogenizations  and  dehomogenizations 
of  polynomials.  The  name  H-basis  originates  from  the  first  letter  of  homoge¬ 
nization. 

Definition  2.  Let  f  G  lK[xi, . . , ,  Xn]  have  degree  d, 

d 

f  =  ^/i,  fi  homogeneous  of  degree  i,  fd  0. 
i=0 

Then  introducing  a  new  variable  xq,  the  homogenization  of  f  is  a  homogeneous 
degree  d  polynomial  in  K[xo,  xi, ... ,  Xn], 

m 

A  homogeneous  F  G  ]K[a:o,  Xi,,  ..,Xn]  can  be  dehomogenized  to  an  f  by 
a:o  =  1- 

For  more  details  on  homogenizations  and  their  connection  to  projective 
coordinates,  we  refer  to  [3]. 
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Theorem  2.  (Macaulay  [4]^.  Let  I  =  {hi, ...  ,hs).  Then  the  following  state¬ 
ments  are  equivalent 

1)  The  least  ideal  containing  all  ^{h),  0  h  e  I,  is  . .. ,  ^{hg)). 

2)  xoF  e  mhi), ..., ^hs))  F  e  mhi), ..., ^hg)). 

3)  {hi ,...  ,hs}  is  an  H-basis  of  I. 

The  power  of  the  G-basis  concept  is  mainly  based  on  the  possibility  of 
reducing  a  polynomial  to  a  simpler  one  by  subtracting  suitable  multiples  of 
elements  of  the  G-basis.  A  consequent  application  of  this  reduction  strategy 
gives  the  so  called  normal  form,  in  a  sense  the  simplest  polynomial  obtainable 
by  the  reductions.  We  translated  this  technique  to  H-bases  in  [7],  and  give 
here  for  consistency  a  short  resume  of  the  main  results. 

Definition  3.  We  denote  by  the  space  of  all  homogeneous  degree  d 

polynomials  for  d  E  INo  Si^d  :=  {0}  for  d  <  0.  Let  hi,. . .  ,hg  E  11.  Then 
we  define  a  finite  dimensional  linear  subspace  of  11^  ^  by 

s 

V(i(hi, . . . ,  hg)  :=  {Y^  giMnihi)  |  Qi  E 
1=1 


Analogously  for  an  ideal  I  CU, 

Vd{I)  :=  {Mh{p)  \pe  I,  deg(p)  =  d}  U  {0}. 

We  introduce  an  inner  product  (  .  ,  .)  in  11,  for  instance,  by  the  inner 
product  of  the  (weighted)  coefficient  vectors,  or  by  a  strictly  positive  linear 
functional  J  and  (f,g)  '=  J{f  •  g)  if  K  C  H  or  :=  J{fg)  if  K  =  C.  Then  we 
can  define  orthogonal  complements  Wd{hi,. . . ,  fis)  and  Wd{I)  in  11^  .  Hence 

Vi{hu e  w^hi,. ..,h,)  =  nf  >  and  V41) ffi  Wj{i)  = 

Let  us  consider  a  polynomial  /  of  degree  d.  Then 

Mnif)  e  Vd{hi,  ...,hg)®  Wdihi,. . . ,  hg). 

Let  Wd  denote  its  natural  projection  on  Wd(hi, . . .  ,hs).  This  homogeneous 
polynomial  can  be  computed  by  solving  a  finite  linear  system  of  equations  be¬ 
cause  has  a  finite  dimension.  Hence  there  are  homogeneous  polynomials 
gi, ...  ,gg  such  that 

s 

f  =  wd  d-^gihi  +  fi,  gi  E  h  ^  n^-i. 

1=1 


We  say  /  reduces  to  WdAfi  modulo  {hi, ... ,  hg]  and  call  fi  then  the  remainder 

off. 
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In  the  reduction  modulo  {hi, . . . ,  hg}  the  degree  of  the  remainder  /i  is  less 
than  deg{f).  Hence  this  reduction  can  be  applied  recursively  reducing  /i_i 
constructively  to  Wd+i-i  +  fi  modulo  {hi, . . . ,  h^}  for  i  =  1, . . . ,  d  + 1  starting 
with  /  =  /o  ^  Hd  and  terminating  with  fd+i  =  0,  because  the  constant  fd 
is  either  in  Vo{hi,.  ..,hs)  or  in  Wo(^i,  •  •  •  ?  K)-  Combining  these  reductions 
modulo  {hi, . . . ,  hs},  one  obtains  for  / 

d  d  s 

f  =  '^Wi-\-  gij  e 

t=0  r=0  j=l 

Then  YA=o'^i  uniquely  determined  by  /,  by  {hi, . . .  ,hs},  and  by  the  un¬ 
derlying  inner  product. 

Definition  4.  Let  hi,...,hs  €  H.  We  say  f  e  Ud  reduces  fully  modulo 
{hi,...,hs}  to  if  every  Wi  G  H^i(hi, . . . ,  hg)  is  constructed  as  de¬ 
scribed  above.  called  the  normal  form  of  /  modulo  {hi,...,hs}, 

for  short 

d 

NF(f,{ki,...,ks}) 

j=o 

If  {hi, . . . ,  hs}  is  not  an  H-basis  of  /  :=  (hi, ... ,  h^),  then  Mnif)  is  not 
necessarily  contained  in  I4eg(/)(hi, . . . ,  h^,),  although  /  G  /.  This  means,  that 
eventually  the  first  homogeneous  polynomial  Wd  is  not  0  if  /  G  /.  Hence  at 
most  if  {hi, . . . ,  hg}  is  an  H-basis,  then  NF(/,  {hi, . . . ,  hs})  =  0.  In  fact,  as 
quoted  in  [7]  but  shown  already  in  [9],  {hi, . . .  ,hs}  is  an  H-basis  if  and  only 
if  NF(/,  {hi,...,hs})  =  0  for  every  f  G  {hi,...,hs). 

Another  characterization  of  H-bases  given  in  [7]  is  as  follows. 

Theorem  3.  Let  I  be  an  ideal  and  hi, . . . ,  hs  G  /.  {hi, ...,  hg}  is  an  H-basis 
of  I  if,  and  only  if,  for  all  d  e  IN, 

VdiI)^Vd(hi,...,hg). 


§4.  On  the  Construction  of  H— bases 

Macaulay  proposed  in  [4]  a  procedure  for  computing  H-bases  of  ideals  given  by 
a  basis.  However,  his  description  was  only  by  an  example.  He  claimed  “  This 
procedure  is  a  general  one”.  But  he  needs  in  his  example  the  computation  of 
certain  modules  of  syzygies.  These  can  be  constructed  only  in  special  cases 
or  by  computing  first  a  G-basis  and  then  applying  techniques  as  in  [1]. 

On  the  other  hand,  if  an  admissible  term  ordering  is  compatible  with 
degrees, 

degixj^ <deg{x^^ xj^  -  •  xl^  ^  x^^  •  ■  ■  x^^ , 

then  a  G-basis  with  respect  to  is  also  an  H-basis.  Hence  Buchberger’s 
algorithm  for  computing  G-bases  also  serves  for  computing  H-bases.  This 
seems  a  more  direct  access  than  via  syzygies.  However,  if  one  wants  to  use 
H-bases  instead  of  G-bases,  this  way  is  still  a  detour. 
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In  case  the  number  n  of  variables  coincides  with  the  number  of  given 
polynomials,  then  there  is  an  easy  test  for  H-bases  as  proved  in  [7]. 

Theorem  4.  Let  hi,.. .  ,hn  ben  polynomials  such  that  their  maximal  forms 
,  Mff(hn)  have  only  the  point  (0, . . . ,  0)  as  common  zero.  Then 
{hi, . . .  ,hn}  is  an  H-basis. 

For  an  arbitrary  zero-dimensional  ideal,  i.e.  for  an  ideal  I  such  that  the 
polynomials  in  /  (  equivalently:  the  polynomials  in  an  arbitrary  basis  of  /) 
have  only  a  finite  number  of  common  zeros  in  K  ,  K  the  algebraic  closure  of 
K,  see  [3],  we  present  here  a  procedure  which  computes  an  H-basis  from  a 
given  basis. 

Procedure  for  computing  H-bases. 

In  :  Hoi  ^  finite  polynomial  set  generating  a  zero-dimensional  ideal  I. 
Out  :  H,  an  H-basis  of  I. 

Start:  H  :=  Ho,  d  =  0. 

Loop:  Check  the  finite  dimensional  linear  vector  space 

Vd{n)  :=  gnMnih)  \  gnMnih)  6  nf  >} 

hen 

for  linear  dependencies.  If  Ylhen  ~  compute  p  :— 

^^{Ylhen  9hh,H)).  If  p  7^  0,  then  enlarge  H  by  p,  and  modify  con¬ 
sequently  Vo{H), . . .  ,Vd^i{H).  Lower  d  to  the  least  k  where  Vk{H)  is 
changed  and  go  to  Loop.  If  for  no  linear  dependency  such  p  is  nonzero, 
then  then  enlarge  d  by  1.  If  now  Vd{H)  =  H^^^  holds  true,  then  return 
H  otherwise  go  to  Loop. 

This  informal  description  can  be  extended  easily  to  a  correct  algorithm.  One 
has  to  observe  that  the  checking  of  Vd{H)  for  linear  dependencies  needs  a  basis 
of  the  nullspace 

s 

{(^1>  •  ■  •  ^  ^  ^  ^i-de5(h,)  I  =  ^]  i 

i=l 

where  H  =  {hi, . . .  ,hs}.  If  for  every  basis  element  {gi,. . .  ,gs)  the  normal 
form  of  holds  for  every  element  of  the  nullspace,  i.e. 

for  every  dependency.  As  a  byproduct  of  the  basis  computation  one  obtains 
dimVd{H).  Then  the  test  Vd{H)  —  H^'^^  reduces  to  a  comparison  of  the 

/  rt\ 

dimensions  because  of  Vd{H)  Q  H^  . 

For  proving  correctness  and  termination,  we  consider  first  /  :=  Ylhen  9^^ 

with  ghMnih)  e  hJ/^^  for  all  heH.  If  NF(/,?7)  =  0,  then  especially  M//(/)  € 
Vk{H)  for  a  /c  <  d,  and  hence 

Mnif)  e  {MH{hi),...,MH(hs))i  where  H  =  {/ii, . . . , /i^}- 
In  case  p  :=  NF(/,7^)  7^  0  either  Mnif)  Mh{p)  holds,  i.e.  again 
MH{f)e{MH{hi),...,MH(hs)), 
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or  Mnif)  =  Mh(p)  holds,  i.e. 

MhU)  e  (Mh{/m),  . . . .  MH{h.),  Mh(p)). 

Therefore,  if  in  the  procedure  d  is  increased  by  1  (and  7i  is  updated),  then  for 
every  0  ^  9hh  G  /,  deg{gh)  +  deg{h)  <  d  the  relation 

Mh{Y,  9hh)  6  {Mnihi), M„{h,)) 
hen 

holds  where  again  U  =:  {hi, . . . ,  hj.  This  is  our  inductive  hypothesis. 

The  ideal  I  has  an  H-basis,  say  {<^i,  is  a  basis  of  I.  Hence 

every  has  a  representation  g)i  =  Y^j-i9ijhj,  gij  G  H.  If  the  inductive 
hypothesis  holds  for  d,  then  one  obtains  at  least  for  d  >  M  :=maiXijdeg{gijhj) 
that 

MH{(pi)  e  {MH{hi),...,MH{hs)),  2  = 

Hence  for  those  d  Vd(v?i,  •  • .  ,¥?m)  Q  VdiH).  But  Vd{(pi,  ■  ■  ^  ,<fm)  = 
since  {<^i, . . .  is  an  H-basis  of  1.  Therefore  VdiH)  =  Vd{I)  for  d>  M. 
By  the  inductive  hypothesis,  also  VkiTi)  =  Vk{I)  holds  for  fc  <  M.  Hence  Ti 
is  an  H-basis  of  I  if  we  arrived  at  a  d  >  M  in  the  procedure. 

The  ideal  I  has  dimension  0.  Then  there  is  a  D  such  that  Vd{I)  =  H^ 
for  all  d  >  D,  see  for  instance  [3,  Ch  9.4,Prop.6]  and  [3,  Ch  5.3,Thm.6]. 
Hence  for  d  >  max{D,  M]  one  has  VdiTi)  =  Thus  in  the  course  of  the 

procedure,  one  arrives  once,  not  knowing  M,  at  a  do  with  VdoiTC)  =  H^^  . 
Then  also  Vk(7i)  =  H^^^  for  all  k  >  do.  Therefore,  for  every  polynomial  f  el 
with 

f  =  9i  ^  ^i-deg{hi) 

i-1 

the  assumption  Mnif)  ^  (^/f(hi), . . . ,  Mnihs))  leads  to  deg{MH(f))  <  do- 
But  then  the  inductive  hypothesis  gives  a  contradiction.  Therefore  the  proce¬ 
dure  gives  no  new  enlarging  the  set  H.  This  ensures  termination. 

An  implementation  of  an  algorithm  based  on  this  procedure  and  a  com¬ 
plexity  analysis  is  still  a  work  under  progress. 
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H-Bases  II:  Applications 
to  Numerical  Problems 


H.  Michael  Moller  and  Thomas  Sauer 


Abstract.  We  show  how  H-bases  can  be  applied  to  polynomial  interpo¬ 
lation  and  for  the  solution  of  systems  of  nonlinear  equations.  We  will  give 
an  example  of  a  system  of  polynomial  equations  where  the  H-basis  leads 
to  more  stable  computations  than  with  the  Grobner  basis. 


§1.  Introduction 

In  the  preceding  paper  [12],  we  introduced  the  notion  of  H-bases  for  poly¬ 
nomial  ideals,  and  showed  how  to  construct  H-bases  in  the  numerically  most 
interesting  case  of  a  zero  dimensional  ideal.  In  this  paper  we  consider  two  prob¬ 
lems  from  Numerical  Analysis,  namely  polynomial  interpolation  and  solving 
systems  of  polynomial  equations,  and  point  out  how  H-bases  can  be  applied 
to  both.  More  precisely,  in  both  cases  the  computation  of  normal  forms  with 
respect  to  an  ideal  plays  a  crucial  role,  and  with  the  basic  results  from  [12] 
available,  H-bases  yield  a  perfect  replacement  for  the  Grobner  bases  which  are 
normally  and  frequently  used  to  do  this  job  [8].  Finally,  we  will  consider  an 
example  where  a  properly  chosen  H-basis  leads  to  a  significant  stabilization 
of  the  computations  in  comparison  with  the  use  of  Grobner  bases. 


§2.  Interpolation 

A  finite  set  0  C  H'  of  linearly  independent  functionals  on  H  is  said  to  define 
an  ideal  interpolation  scheme  if  its  kernel,  kerO  C  H,  is  an  ideal  in  H.  Given 
an  ideal  interpolation  scheme  0  and  a  polynomial  /  G  H,  the  interpolation 
problem  consists  of  finding  p  G  H  such  that 

0(p)  =  0(/),  i.e.,  'd{p)  =  'd(f),  t?  G  0.  (1) 
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So  far,  we  have  put  no  restrictions  on  p;  hence,  there  are  infinitely  many 
solutions  to  (1).  More  precisely,  if  p  is  any  solution  of  (1),  hence  /  -  p  G  ker  0, 
then  the  set  of  all  solutions  is  the  equivalence  class 

[p]  =  p  -I-  ker  0  =  /  +  ker  0  [/] . 

We  denote  the  linear  space  of  all  equivalence  classes  by  11/  ker  0,  and  remark 
that  (dim  n/ ker  0)  =  #0.  Of  course,  in  order  to  compute  interpolation  poly¬ 
nomials,  we  must  find  a  way  to  choose  a  specific  element  from  the  equivalence 
class  [/].  A  “natural”  choice  is  to  take  the  normal  form  NF  (/,  Tf),  where  Ti 
is  an  H-basis  for  ker0.  Since  [/]  =  [g]  implies  that  f  —  g  ^  {H),  and  since 
NF  (-,7^)  is  a  linear  operator,  we  have  that 

[/l  =  b]  NF  (/,«)  =  NF  (3.  W)  +  NF  (/-ff.W)  =  NF  (p.H). 

=0 

Hence,  NF  ([/]  ,7i)  =  NF  (/,  W),  that  is,  the  normal  form  is  the  same  for  any 
element  of  the  same  equivalence  class.  This  algebraic  approach  also  allows 
for  interpolation  of  functionals  which  are  only  given  implicitly,  that  is,  by  an 
ideal  T  C  H:  compute  an  H-basis  H  for  X  and  the  interpolation  operator  is  the 
“remainder  of  division”  NF  ('■,'H).  It  is  worthwhile  to  remark  that  one  of  the 
oldest  papers  on  multivariate  interpolation,  namely  [6],  starts  with  implicitly 
given  interpolation  nodes. 

Another  approach  is  to  look  for  a  polynomial  space  V  CTl  which  allows 
for  unique  interpolation  with  respect  to  0;  to  restrict  the  number  of  solutions 
to  this  problem,  one  usually  demands  the  interpolation  operator  L'p  :  H  — >  "P 
to  be  degree  reducing  [3],  that  is, 

degLp/<deg/,  /  G  H. 

Such  an  interpolation  space  with  a  degree  reducing  interpolation  operator 
is  called  a  minimal  degree  interpolation  space.  The  most  prominent  minimal 
degree  interpolation  spaces  is  the  least  interpolation  space  introduced  by  de 
Boor  et  al  in  [2],  and  is  the  unique  degree  reducing  interpolation  space  which 
satisfies  the  additional  condition 

^Gker© 

On  the  other  hand,  it  is  obvious  that  the  operator  NF  (-,  7f)  is  degree  reducing, 
linear  and  interpolating,  hence  all  the  spaces  P  =  NF  (H,?^),  for  any  H- 
basis  are  minimal  degree  interpolation  spaces  with  interpolation  operator 
Lp  =  NF  (•,  ?Y).  Moreover,  it  is  even  possible  to  recover  known  minimal  degree 
interpolation  spaces  by  this  algebraic  process. 

Theorem  1.  [15]  The  least  interpolation  space  is  given  as  NF  where 

H  is  an  orthogonal  H-basis  with  respect  to  the  inner-product 


(p,q)  =  (p(D)  q)(0},  G  n. 
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§3.  Polynomial  System  Solving 

Probably  the  best-known  and  most  frequent  use  of  Grobner  bases  is  for  solv¬ 
ing  polynomial  systems  of  equations,  where  they  form  a  core  part  of  literally 
all  available  computer  algebra  systems.  These  systems  of  equations  arise  natu¬ 
rally  in  a  geometric  context,  such  as  finding  solutions  of  geometric  constraints 
(for  example,  any  Euclidean  distance  constraint  yields  a  quadratic  equation) 
or  “simply”  computing  the  intersection  of  algebraic  curves/surfaces  given  in 
implicit  form.  So,  given  any  finite  set  .7^  G  11  one  wants  to  find  the  associated 
algebraic  variety  X  G  IK  (some  algebraic  closure  of  our  underlying  field  K) 
such  that 

T{X)  =  0,  (2) 


that  is. 


f{x)  =  0,  xex,feT. 


Note  that  the  emphasis  here  is  not  on  finding  one  solution  (which  could,  at 
least  in  the  case  that  =  n,  be  done  by  a  Newton  method),  but  on  finding 
all  solutions  and  obtaining  structural  information  about  the  variety.  It  is  easy 
to  see  that  the  variety  is  not  a  property  of  the  specific  set  .F,  but  of  the  ideal 
{/•>: 

J^(X)  =  0  ^  {r)  (X)  =  0. 


Therefore,  it  may  be  helpful  to  find  particular  bases  for  (F)  which  allow  for 
an  efficient  solution  of  (2).  The  “classical”  implementation  in  most  Computer 
Algebra  systems  relies  on  the  computation  of  elimination  ideals,  which  means 
the  computation  of  a  basis  for  the  subideals 


(F)fc  =  (F>nK[a;i,...,Xfc],  fc  =  l,...,n. 


where  {F)„  =  (F).  In  fact,  this  corresponds  to  transforming  the  original 
problem  F(X)  =  0  into  a  triangular  system 


9i{ 

Xi 

) 

=  0, 

52  ( 

Xi,  X2 

) 

=  0, 

9m{ 

^25 

•  j  ) 

=  0. 

Once  such  a  triangular  system  is  available,  the  solution  strategy  is  obvious; 
determine  the  zeros  of  the  univariate  polynomial  gi  (ici)  and  substitute  them 
into  52  ('j  X2)  which  is  now,  for  for  any  such  zero,  again  a  univariate  polynomial 
in  X2,  and  go  on  with  this  procedure.  Moreover,  such  a  triangular  basis  can 
indeed  be  computed:  Qiexy  the  reduced  Grobner  basis  for  (F)  with  respect  to 
the  lexicographical  term  order  where  xi  ^  X2  ^  ^  Xn  has  the  property 

that 

^  n  K  [rci, . . . ,  Xfc]  C  (F)jfc 


is  a  Grobner  basis  for  (F)^  (cf.  [4,  p.  114]). 
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However,  as  nice  as  this  idea  of  successive  elimination  of  variables  sounds, 
there  are  numerous  drawbacks: 


(i)  The  complexity  of  computing  a  lexicographical  Grobner  basis  is  tremen¬ 
dous,  and  even  relatively  “simple”  problems  still  exceed  the  limitations 
of  existing  computing  facilities. 

(ii)  There  are  often  several  polynomials  in  a  certain  number  of  variables,  that 
is,  the  system  is  not  as  triangular  as  one  would  want  it  to  be. 

(iii)  The  degree  of  the  polynomial  gi  is  usually  very  high.  This  makes  it 
impossible  to  compute  its  zeros  exactly. 

(iv)  The  tempting  idea  to  find  gi ’s  zeros  approximately  and  substitute  these 
values  will  not  lead  very  far  since  it  is  well-known  that  the  zeros  of  a 
polynomial  are  usually  quite  ill-conditioned  with  respect  to  its  coefficients 
(cf.  [5,17]), 


So,  the  summary  is  fairly  disappointing:  elimination  methods  do  not  provide 
a  good  tool  to  tackle  polynomial  systems  of  equations.  In  particular,  they  rely 
too  much  on  symbolic  methods  (with  exact  computations)  to  become  a  useful 
tool  in  numerical  applications. 

A  different  approach  has  been  proposed  quite  recently  by  Stetter  [16]  (see 
also  [10];  in  [7]  this  method  is  partly  attributed  to  Stickelberger)  which  is  based 
on  transforming  the  nonlinear  system  of  equations  into  an  eigenvalue  problem 
for  which  a  huge  library  of  powerful  routines  is  available.  For  that  purpose, 
let  us  assume  that  the  set  of  solutions  X  is  finite  (that  is,  the  associated  ideal 
{T)  is  zero  dimensional)  and  that  all  the  common  zeros  are  simple.  The  latter 
restriction  is  made  to  keep  the  presentation  simple;  details  on  how  to  handle 
multiplicities  can  be  found  in  [10].  We  first  note  that  for  any  /  G  II,  the 
mapping 


n/W 

[p] 


n/{r) 

[f-p] 


is  a  homomorphism  on  the  ^X-dimensional  linear  space  11/  (JF).  Now,  sup¬ 
pose  for  a  moment  that  we  know  X.  Then  there  are  polynomials  ^  n? 
X  ^  X,  defined  by 

Px  (^  )  —  ^  ^  X , 


which  form  a  basis  for  11/  (/'),  i.e.. 


n/  (T)  =  span  {  [pa:]  :  a;  G  A  }  . 


Obviously,  for  any  x  e  X,  the  polynomial  gx  —  {f  -  f{^))Px  satisfies  gx{X)  = 
0,  and  therefore 


[01  =  bx]  =  [  (/  -  f{^))Px  1  =  \px]  -  f{x)  [px]  • 


What  we  have  proved  with  this  simple  argument  is  the  following  crucial  the¬ 


orem. 
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Theorem  2.  The  polynomials  px,  x  e  X,  are  joint  eigenvectors  of  all  bomo- 
morpbisms  $/,  /  G  H,  with  respect  to  tbe  eigenvalue  f(x). 

This  result  again  suggests  a  strategy  to  solve  polynomial  systems  of  equa¬ 
tions:  compute  a  set  of  representers  for  11/  (^),  that  is,  a  finite  set  P  C  11  of 
linearly  independent  polynomials  such  that 

n/  (^)  =  span  {[p]'P^V}, 

and  compute  the  matrix  M/  which  describes  the  action  of  with  respect  to 
the  basis  V.  The  eigenvalues  of  such  matrices  yield,  when  combined  appro¬ 
priately,  the  solutions  X.  We  remark  that  the  (transpose  of  the)  matrix  Mf 
is  called  the  multiplication  table  for  /  with  respect  to  'T,  and  that  the  original 
goal  for  Buchberger’s  doctoral  thesis  (supervised  by  Grdbner)  was  not  the 
invention  of  Grobner  bases  but  the  computation  of  multiplication  tables.  Of 
course,  the  most  natural  approach  would  be  to  compute  the  multiplication 
tables  Mxj,  j  =  1, . .  •  ,n,  for  the  coordinate  functions  and  thus  compute  the 
respective  coordinates  of  the  elements  of  X  as  the  eigenvalues  of  the  multipli¬ 
cation  table.  Note  that  the  different  components  are  finally  “glued  together” 
by  the  requirement  that  they  must  correspond  to  the  same  eigenvector. 

What  we  now  have  is  the  possibility  of  reducing  the  search  for  the  solu¬ 
tions  of  a  polynomial  system  of  equations  to  an  eigenvalue  problem,  provided 
that  we  are  able  to  perform  two  operations: 

(i)  Given  a  basis  T  for  an  ideal  {T)  compute  a  basis  V  of  representers  for 

n/(/'). 

(ii)  Having  this  basis  available  and  given  any  /  G  H,  compute  the  multipli¬ 
cation  table  Mf  with  respect  to  V. 

Fortunately,  this  is  where  [12]  enters  -  the  answer  are  normal  forms:  if  H  is 
an  H-basis  for  (.F),  then  any  basis  for  NF  (H,  H)  is  exactly  the  desired  V,  and 
the  action  of  can  be  computed  by  expanding  NF  (/  ■  p^H)  for  all  p  Q  V, 
which  yields  the  multiplication  table  M f .  The  remaining  question  is  “why  H- 
bases?”,  and  this  question  is  justified  since  the  computation  of  normal  forms 
and  thus  of  multiplication  tables  is  perfectly  possible  with  the  help  of  Grobner 
bases  as  well.  To  give  a  partial  answer  to  this  question,  we  look  at  an  example. 

§4.  When  Two  Ellipses  Meet 

In  this  section  we  consider  a  simple  example  which  will  show  that  also  the 
eigenvalue  method  can  encounter  serious  obstacles,  in  particular  when  Grobner 
bases  are  involved.  The  important  thing  here  is  simplicity,  as  it  will  not  be 
too  surprising  if  extremely  complicated  and  difficult  examples  cause  problems. 

We  consider  the  two  ellipses 

9ix,y)  =  ~x‘^-h^y^-l. 
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Fig.  1.  The  two  ellipses. 


Clearly,  these  two  ellipses  intersect  in  the  four  well-separated  points  (±1,  ±1) 
as  can  be  seen  in  Fig.  1. 

Now,  we  are  going  to  perturb  g  a  little  bit  and  replace  it  by  Qfp  = 
g  (Atp(x,y))^  where  denotes  the  rotation 


cos  (f) 

sm(f) 

X 

—  sin0 

COS0 

y . 

Note  that  we  have  in  mind  small  values  of  0,  so  the  intersections  should  still 
be  close  to  (±1,±1)  and  the  problem  should  still  be  well-conditioned. 

Recalling  that  lexicographic  Grobner  bases  are  known  as  troublemakers, 
we  first  try  some  “better”  Grobner  basis,  namely  the  one  which  is  based  on 
the  graded  lexicographic  term  order  with  x  ^  y.  Note  that  this  ideal  basis  is 
not  only  a  Grobner  basis,  but  also  an  H-basis.  In  this  case  the  Grobner  bases 
Q(P  consists,  for  0  0,  of  the  three  polynomials 

4  sin  0  xy  +  3  cos  (/)  x^  -  3  cos 

+  2y2  -  3, 

cos  <!>  (cos^  0  +  8)  —  Z  cos  0  (cos^  0  +  2)  X  +  12  sin  0  (sin^  0  —  1)  y, 


while 

Go  =  -  1,2/^  -  1}  • 

Here  we  already  observe  that  some  singularity  must  appear  for  ^  =  0,  since 
Qo  is  not  just  a  limit  0  — ^  0  of  although  the  basis  changes  continuously 
with  respect  to  (j).  The  singularity  becomes  more  apparent  if  we  look  at  the 
normal  forms,  which  are 

.p  ^  f  {l,x,?/,a;2}  if  (j!)  0, 

\  {l,x,y,2:2/}  if^6  =  0. 
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Finally,  the  multiplication  tables  for  the  multiplication  by  x  take  the 

form 


-0 

1 

0 

0“ 

II 

o 

1 

0 

0 

0 

0 

0 

0 

1 

table 

.0 

0 

1 

0. 

ro 

1 

0 

0 


3cos<^ 

4sin(^ 

0 

0 

3cos<^ 

4sin0 


0 

o  5cos^  0—8 
^  cos^  <^+8 
1  o  sin  4>  cos  <t> 
cos2<^+8 

0 


^  0, 


provides  us  with  difficulties.  Not  only  does  this  matrix  not  converge  to  Mx,o 
for  (/)  — >  0,  but  some  entries  in  this  matrix  even  diverge  to  ±oo,  respectively. 
Indeed,  if  one  tries  to  compute  the  eigenvalues  and  eigenvectors  of  this  matrix 
for  small  values  of  0,  things  become  disastrous:  A  Maple  computation  with 
10  digits  worked  until  about  (j)  ~  10“^,  where  an  error  message  reported  that 
the  QR  algorithm  did  not  work.  For  smaller  values,  like  (j)  ~  10~®,  Maple 
invented  complex  zeros  with  an  imaginary  part  of  the  magnitude  0.5  x  10~^ 
which  by  far  exceeds  any  negligible  machine  number.  On  the  other  hand, 
Octave,  a  free  Matlab  clone  whose  Linear  Algebra  facilities  are  based  on 
LAPACK  [1],  reproduced  the  eigenvalues  correctly,  but  gave  eigenvectors  which 
were  practically  0. 

Hence,  we  end  up  with  some  kind  of  paradox  which  is  due  to  a  singularity 
at  0  =  0;  though  the  original  problem  of  solving  the  polynomial  system  of 
equations  is  very  well-conditioned,  the  graded  lexicographical  Grobner  basis 
is  extremely  sensitive  to  very  perturbations  (|<^|  <  10“^),  but  by  far  not 
so  sensitive  to  relatively  “large”  (|(^|  >  10“®)  perturbations. 

Similar  problems  appear  when  we  replace  the  graded  lexicographical 
Grobner  basis  by  a  purely  lexicographical  one  with  x  ^  y  which  yields  the 
normal  forms 

if  0  ^  0, 

^  \{l,x,2/,a:?/}  if  0  =  0. 

Though  the  components  of  the  multiplication  table  at  least  are  contin¬ 
uous  functions  in  0  and  remain  bounded  in  this  case,  the  limit  0  — >  0  again  is 
not  Mxfi.  But  the  multiplication  tables  with  respect  to  the  purely  lexi¬ 
cographical  Grobner  basis  is  even  worse:  its  entries  are  either  zero  or  diverge 
for  0^0. 

The  behavior  of  the  Grobner  bases  at  0  =  0  raises  the  question  of  whether 
this  singularity  is  systematic,  that  is,  intrinsic  to  the  problem,  or  if  it  is  a  repre¬ 
sentation  singularity  generated  by  the  Grobner  bases.  Systematic  singularities 
appear,  for  example,  if  several  zeros  “collapse”  into  one  multiple  zero  which 
leads  to  extremely  intricate  problems  in  the  multivariate  case  [9].  Here,  how¬ 
ever,  the  good  separation  of  the  zeros  suggests  the  conjecture  that  we  only 
face  a  representation  singularity. 


340 


H.  M.  Moller  and  T.  Sauer 


Indeed,  since  H-bases  leave  more  degrees  of  freedom,  we  can  try  another 
one  which  is  now  based  on  orthogonalization.  For  this  purpose,  we  use  the 
inner-product 

{p,Q)  =  ip{D)  q)  (0) 

and  recall  from  [11,  Theorem  5.3]  that  the  set  {f,g4>}  is  already  an  H~basis. 
Moreover,  the  normal  form  space,  which  is,  according  to  Theorem  1,  the  least 
interpolation  space,  is  spanned  by 

=  {l,rc,7/,  2sin0  -3cos(p  xy  -  sin0  y^} 

and  depends  continuously  on  0  with 


lim  Vl=Vo  =  {1,0;,  y,  xy}  . 

<p — ►O 


Then  one  can  compute  the  respective  multiplication  table  as 

0  l+£l(0)  ^2(0)  £3(0) 

1  0  0  £4(0) 

1  0  0  H-£5(0)  ’ 

0  £5(0)  1+£7(0)  £8(0)  - 

where  £j  (•),  jf  =  1, . . . ,  8,  are  continuous  functions  which  vanish  at  the  origin. 
In  particular,  — >■  Mx,o  as  a:  — >  0  and  the  computation  of  eigenvalues 

and  eigenvectors  of  can  now  be  done  with  sufficient  accuracy.  However, 
we  remark  that  the  fact  that  the  matrices  M*  ^  and  M*  ^  have  two  approxi¬ 
mately  double  eigenvalues  ±1,  requires  some  extra  care  when  connecting  these 
individual  values  in  the  final  determination  of  the  intersections. 


§5.  Summary 

We  have  given  examples  of  numerical  applications  which  can  be  reduced  to 
the  computation  of  normal  forms  with  respect  to  a  certain  polynomial  ideal, 
an  operation  which  is  usually  performed  using  a  Grobner  basis.  On  the  other 
hand,  H-bases  could  as  well  be  used  for  normal  form  computations,  and  their 
greater  flexibility  may  yield  stabilizing  effects  which  are  highly  desired  in  nu¬ 
merical  computations. 
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Dependence  Structure  of  Random  Wavelet 
Coefficients  in  Terms  of  Cumulants 


Philippe  Naveau,  Peter  Brockwell,  and  Doug  Nychka 


Abstract.  When  the  Gaussian  assumption  for  a  times  series  no  longer 
holds,  second  order  moment  properties  such  as  the  covariance  and  the 
spectrum  are  not  necessarily  sufficient  to  describe  the  dependence  struc¬ 
ture.  Although  wavelet  models  have  been  proposed  to  de-correlate  the 
signal,  this  strategy  must  be  reexamined  when  applied  to  non-Gaussian 
processes.  The  process  of  interest  is  a  continuous  parameter,  mean-squared 
continuous  real- valued  process  that  is  not  necessarily  Gaussian  or  linear. 
To  study  the  departures  from  linearity  and  Gaussianity,  we  consider  joint 
cumulants,  which  are  linear  combinations  of  higher  order  moments,  and 
their  associated  spectra.  A  specific  objective  is  to  obtain  new  expressions 
for  cumulants  of  the  random  discrete  wavelet  coefficients  instead  of  the 
second  order  moments,  and  to  study  their  higher  order  polyspectra.  Con¬ 
ditions  on  the  polyspectrum  to  give  null  wavelet  cumulants  within  and 
across  wavelet  coefficient  levels  are  derived.  Expressions  of  the  original 
cumulants  as  a  function  of  the  wavelets  cumulants  are  also  given. 


§1.  Introduction 

The  covariance  and  spectral  properties  of  the  discrete  wavelet  coefficients  for 
random  continuous  real-valued  processes  have  been  extensively  studied  in 
the  past.  Among  others,  Donoho  et  al.  [4],  Flandrin  [6],  Mallat  et  al.  [13], 
Masry  [14],  and  Walter  [17]  have  investigated  the  correlation  within  and 
across  wavelet  coefficients.  Focusing  exclusively  on  the  second  order  prop¬ 
erties  of  wavelet  coeflficients  for  a  Gaussian  process  is  a  reasonable  task  since 
the  dependence  structure  of  Gaussian  processes  is  entirely  characterized  by 
the  covariance.  When  the  normality  assumption  no  longer  holds,  higher  order 
cumulants  are  necessary. 

Exploring  some  of  the  links  that  exist  between  wavelets  and  cumulants 
is  fairly  new.  Brillinger  [1]  studied  a  non-parametric  regression  problem  with 
cumulants  and  wavelets.  In  geophysics  and  astrophysics,  Lazear  [11]  and 
Ferreira  et  al.  [5]  applied  wavelets  and  cumulants  to  seismic  data  sets  and 
to  the  Cosmic  Microwave  Background  problem.  The  dependence  structure 
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between  wavelet  packets  and  in  particular  wavelet  coefficients  has  also  been 
studied  by  D.  Leporini  and  J.C.  Pesquet  [12].  In  this  paper,  we  derive  new  and 
general  results  about  the  dependence  structure  of  random  wavelet  coefficients 
via  its  cumulants. 

The  process  of  interest  X(i),  indexed  by  the  real  parameter  t,  is  supposed 
to  be  a  continuous  real-valued  process  that  is  mean-square  continuous,  and 
such  that  moments  of  some  order  /  >  2  exist,  i.e., 

sup£|X(t)|' <  00  and  lim  +  ft)  -  Xmp  =  0.  (1) 

t  h-*0 

§2.  Cumulant  Definition  and  Properties 

If  some  useful  information  of  the  signal  is  not  contained  in  the  second-order 
covariances  (and  the  second  order  spectra),  then  one  can  still  calculate  some 
meaningful  linear  combination  of  higher  order  moments,  called  cumulants. 
Some  early  work  on  higher  order  cumulants  and  their  Fourier  transform  was 
proposed  by  Hasselman  et  al.  [8]  for  investigating  nonlinear  interaction  of 
ocean  waves,  and  Godfrey  [7]  used  it  for  the  analysis  of  economic  time  se¬ 
ries.  Rosenblatt  with  Lii  and  Van  Atta  in  a  series  of  papers  have  described 
how  higher  cumulants  could  be  used  to  study  nonlinear  transfer  of  energy  in 
turbulence. 

The  joint  cumulant  of  the  set  of  random  variables  {A(ii), ..., 
denoted  by  CUM{X{ti),  with  m  <  /,  is  given  by 

CUMiX{h),...,X(t^))  =  Y^(-ir{p-  !)!(£  J]  X{Q)...{E  J] 

rCvi  r€vp 

where  the  summation  extends  over  all  partitions  {fi,  of  {1,  ...,m}  with 

p  =  l,...,m.  From  this  definition,  we  can  notice  that  the  information  con¬ 
tained  in  the  first  m  cumulants  is  exactly  the  same  as  that  contained  in  the 
first  m  moments.  However,  cumulants  have  some  advantages  over  moments. 
For  example,  cumulants  have  useful  linear  properties, 

CUM{Z  +  Xih), X{t^))  =  CUM{Z, X(<„))  +  CUM{X{ti), X(i„)), 
CUM{aX{ti),...,Xit„))  =  aCUM{X{ti),...,X{tm)), 

for  any  real  a.  Another  important  property  of  cumulants  concerns  the  depen¬ 
dence  structure  of  the  process:  if  some  subset  of  {A’(ii), ...,  is  inde¬ 

pendent  of  the  remainder,  then  CUM(X{ti)^  ...,X{tm))  is  identically  equal  to 
zero.  Hence,  the  cumulant,  CUM{X{ti), X{tm))  can  be  interpreted  as  a 
measure  of  dependence  of  {A’(ti),  ...,X(fm)}.  For  the  special  case  of  Gaussian 
processes,  cumulants  of  order  higher  than  two  are  zero. 

In  the  remainder  of  this  section,  we  suppose  that  the  process  {A’(f)}  is 
stationary  up  to  order  I,  i.e., 


E{X{to)X{h)...X{ti))  =  E{X{to  +  h)X{ti  +  h)...X{ti  +  h)),  V/i. 
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Stationarity  as  just  defined  is  frequently  referred  to  in  the  literature  as  weak 
stationarity.  For  us  however  the  term  stationarity,  without  further  qualifica¬ 
tion,  will  always  refer  to  the  above  equality. 

Under  stationarity,  CUM{X{t),X(t  -h  si),  ...,X(t  -f  Sm-i))  does  not  de¬ 
pend  on  t,  and  can  be  denoted  by  7771(51, 5771-1).  With  these  notations,  the 
second  order  cumulant  71  (u)  is  just  the  covariance  function.  The  third  order 
cumulant  72(14,  is  the  same  as  the  third-order  central  moment, 

E(iX{t)  -  ti)(X(t  +  u)-  p){X{t  -hv)-  m)), 

where  p  is  the  mean  value  of  the  process. 

Prom  the  covariance  function,  one  may  define  the  power  spectrum,  i.e., 
the  Fourier  transform  of  71  (t),  =  f  jiit) €xp(-iujt)dt.  A  natural  exten¬ 

sion  is  the  77i*^-order  polyspectrum  defined  by 

m—l 

7jti(^15  •••!  — 1)  GXp(— i  E  coj  tj  )dt  1 .  ..dt'fj^ — 1 , 

j=i 

assuming  that  the  above  Fourier  transforms  exist.  An  important  property  of 
the  polyspectra  is  that  all  polyspectra  of  higher  order  than  second  order  vanish 
when  {A'(t)}  is  a  Gaussian  process.  Another  characteristic  of  the  polyspectra 
is  that  the  ratio 

|/2(c^l,^2)| 

/l(wi)/l(^2)/l(^^l  +^^2) 

is  constant  whenever  the  process  {X{t)}  is  linear.  Hence,  the  simplest  higher 
order  spectrum,  called  bispectrum,  can  be  regarded  as  deviation  measures  from 
Gaussianity  and  linearity.  Different  statistical  tests  have  been  derived  from  it 
(see  Subbua  Rao  and  Gabr  [9]  and  Hinich  [16]). 

§3.  Random  Wavelet  Coefficients 

Consider  a  discrete  orthonormal  wavelet  decomposition  of  a  stochastic  process 
{X(t)}  that  satisfies  condition  (1).  The  corresponding  wavelet  coefficients 

Wj,k  =  j  (2) 

are  random  variables.  Here  the  equality  sign  is  to  be  understood  in  the  mean- 
square  sense,  and  '0i,fc(O  ~  —  k)  is  an  orthonormal  wavelet  basis 

function  with  the  mother  wavelet  -0.  A  rigorous  framework  concerning  the 
construction  for  wavelet  orthonormal  basis  can  be  found  in  Meyer  [15]  and 
Daubechies  [3].  There  exist  many  candidates  for  the  mother  wavelet.  The 
simplest  example  of  an  orthonormal  wavelet  basis  is  provided  by  the  Haar 
system  for  which 

11,  if  0  <  a;  <  0.5, 

-1,  if  0.5  <  a:  <1, 

0,  otherwise. 
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Our  main  interest  is  to  understand  the  dependence  structure  between  the 
different  wavelet  coefficients  defined  by  (2). 

In  this  article,  we  will  either  use  the  Haar  system  as  a  simple  example 
of  compactly  support  wavelets,  or  a  particular  type  of  band-limited  wavelets 
called  Meyer-type  wavelets  (see  Walter  [17],  Zayed  and  Walter  [18]).  Having 
a  compact  support  in  the  frequency  domain  can  facilitate  the  computation  of 
wavelet  cumulants.  These  Meyer- type  wavelets  have  some  additional  attrac¬ 
tive  features,  such  as  being  highly  smooth  (they  can  be  made  C°^)  and  having 
fast  decay  in  the  time  domain.  They  are  introduced  as  follows. 

Let  F  be  any  probability  measure  supported  on  [— e,e]  for  some  e  <  tt/S. 
Then  the  mother  wavelet  ^(•)  is  defined  by  its  Fourier  transform 

^(cj)  ==  exp(-ia;/2)[  /  dF]^/^  (3) 

J  [a;/2|  — TT 

Prom  this  definition,  it  is  possible  to  check  that  the  orthogonality  and  dila¬ 
tion  conditions  are  satisfied  for  the  wavelet  basis  generated  from  this  mother 
wavelet.  There  is  a  large  class  of  distributions  that  can  be  chosen  in  equality 
(3),  and  the  edges  of  the  support,  [-e,  e],  can  be  made  highly  smooth. 

§4.  Dependence  Structure 

The  dependence  structure  between  wavelet  coefficients  is  closely  related  to  the 
dependence  inside  the  original  signal.  Hence,  our  first  problem  is  to  explain 
how  to  obtain  the  joint  cumulants  of  the  wavelet  coefficients  from  the  joint 
cumulants  of  the  process.  The  first  proposition  takes  care  of  this  problem. 

Because  of  space  limitations,  the  proof  of  our  propositions  will  not  be  in¬ 
cluded  in  this  paper.  However,  complete  details  of  the  proofs  can  be  requested 
from  the  authors. 

Proposition  1,  Let  {X{t)}  be  a  stochastic  process  that  satisfies  condition 
(1).  Suppose  that  the  joint  cumulants  of  order  m  <  I  of  {^(t)}  exist.  Then 

m 

n=:l 


Proposition  1  is  directly  applicable  to  compactly  supported  wavelets, 
since  the  product 

m 

JJ  V’jn./Cn  (^n) 

n=l 

is  null  except  at  the  intersection  of  the  translated  and  dilated  supports.  For 
example,  suppose  that  the  wavelet  basis  corresponds  to  the  Haar  system.  The 
expression  of  cumulants  of  the  wavelet  coefficients  becomes 

/  =  1 
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The  Ai  are  rectangular  boxes  defined  by  the  tensor  product 

Ai  =  ^n,Cnn 

n=l,..,m 


where  the  vectors  q  =  {cu, ,  ...jCmi)  represent  the  set  of  all  2”*  possible  con¬ 
figurations  of  {0, 1/2}"^,  and  the  interval  In,c„i  is  defined  by 

=  On  :  2"^"(fcn  +  c„,)  <  tm  <  +  Cni  +  1/2),  for  n  =  1, 


In  order  to  apply  (4)  to  a  more  specific  example,  we  suppose  that  the  process 
{X(t)}  is  a  zero-mean  stationary  process  with  standard  deviation  cr  and  covari¬ 
ance  7i(/i)  =  6exp{-a|h|),  and  7i(/ii,/i2)  =  cexp(-a|/ii  4-/12!),  where  a,6, c 
are  constants  that  depends  on  the  second  and  third  moment  and  other  param¬ 
eters  describing  the  original  process.  Processes  with  such  a  cumulant  function 
correspond  to  Continuous  Auto-Regressive  processes  (CAR)  (see  Brockwell 
[2])  or  equivalently  solutions  of  particular  stochastic  differential  equations  with 
non-necessarly  Gaussian  noise.  After  some  algebra,  wavelet  cumulants  sim¬ 
plify  to 


hJCfi  (j  1 )  fci ) 


[  XI  Hj,[uia)Hj^[u2a)], 


for  2  ^^{k2  4- 1)  <  2  and 


x[  X  Hj^{'^ia)Hj^{u2a)Hj^{u3a)] 


for  4- 1)  <  min(2“'/^fci, 2~-/2/c2)  with  Ka{j,k)  =  exp(a(2“-^ (/c  4-  0.5))) 

and  Hj{a)  =  1  —  exp(a2“'/“^).  The  previous  formulas  can  be  easily  extended 
to  higher  dimensions,  and  can  be  used  to  derive  asymptotic  behavior,  e.g 
lii  ~  jil  T  00  and  so  on. 

Another  possible  application  of  Proposition  1  is  to  non-stationary  pro¬ 
cesses.  A  large  variety  of  models,  such  as  the  bilinear  model,  autoregressive 
models  with  random  coefficients,  and  the  threshold  model,  have  been  pro¬ 
posed  to  take  into  account  of  the  non-stationarity.  To  illustrate  the  use  of 
cumulants,  we  restrict  attention  to  piecewise  stationary  processes,  i.e.  a  sum 
of  independent  stationary  processes: 

r 

<t<  ui+i)X^^\t),  where  X{A)  = 

and  are  independent  stationary  processes  and  the  change-points  are 

equal  to  —00  =  uq  <  ui  <  ■  •  ’  <  Ur  <  Wr+i  =  00.  Because  of  linear  properties 
of  the  cumulants,  we  have  immediately  that 

r  m 

CUM{X{ti),...,X{tm))  = 


r  1,  ate  A 

\  0,  otherwise, 
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where  CUM{X^^\t))  =  CUM{X^^\ti),...,X^^^tm))-  Using  Proposition  1,  it 
follows  that 


r  -  m 

CUM(Wj^)  =  CUM(X(')(i))  2  <  n+i)tu.Kitn)dt„ 

l=l  ''  n=l 

with  CUM(Wjf.)  =  CUM{Wj^ki,-”,Wj^k^).  The  above  expression  shows 
that  wavelet  cumulants  for  piecewise  stationary  processes  can  be  easily  com¬ 
puted  for  compactly  supported  wavelets  such  as  the  Haar  system,  and  when 
each  process  X^^^  has  simple  cumulant  functions  (e.g.  the  CARMA  process). 

From  the  CAR  example,  we  saw  that  wavelet  cumulants  are  computable 
for  the  Haar  system,  but  the  resulting  formula  are  not  so  easy  to  manipu¬ 
late.  Another  approach  is  to  use  band-limited  wavelets.  Simpler  expression 
of  the  wavelet  cumulants  can  be  derived.  To  illustrate  this  point,  we  look 
at  the  Meyer-type  wavelet  in  the  next  proposition.  In  this  case,  the  Meyer- 
type  wavelet  gives  null  or  small  wavelet  cumulants  within  and  across  wavelet 
coefficient  levels  under  simple  conditions. 

Proposition  2.  Let  {A(t)}  be  a  stationary  process  that  satisfies  condition 
(1).  Suppose  that  its  m}^-order  polyspectrum  fm  is  well  defined,  and  the 
orthonormal  basis  {V'jit}  is  generated  by  a  mother  Meyer-type  wavelet.  If 
there  exists  some  integer  j*  in  such  that  ^ 

CUM{Wj^ki,  •••)  —  0-  I^  addition,  if  fm(oj)  and  ^(a;)  are  both  in  C^, 

then  the  cumulant  at  a  hxed  resolution  level  satisfies 

CUM{Wj, =  0(max  J]  |fc,  -  fc,]-”). 

s^r 


Proposition  2  shows  that  the  Meyer-type  wavelet  transform  not  only  can 
remove  the  correlation  inside  the  original  signal,  but  in  addition  the  higher- 
order  cumulants  are  either  null  at  distant  scales  or  very  small  at  a  fixed  scale. 
Walter’s  result  [17]  for  the  covariance  is  a  special  case  of  Proposition  2: 


CUM{Wj,k,,Wj,k,) 


0,  for  lii -;2|  >  1, 

0{\ki  ~  k2\^^),  forji=j2. 


It  is  interesting  to  note  that  the  results  stated  in  Proposition  2  hold  for  any 
choice  of  F  in  the  definition  of  the  Meyer- type  wavelet.  An  open  problem 
is  to  determine  if  there  exist  some  distributions  F  which  will  significantly 
reduce  cumulants  between  wavelet  coefficients.  In  this  direction,  Zayed  and 
Walter  [18]  minimized  the  covariance  between  wavelet  coefficients  by  using  a 
bi-orthonormal  wavelet  basis  that  is  a  function  of  the  original  covariance. 

In  Propositions  1  and  2,  different  expressions  of  the  wavelet  cumulants 
were  derived.  A  natural  question  is  whether  or  not  the  cumulants  of  the 
original  process  can  be  expressed  in  terms  of  CUM{Wj^ki,  Thus, 

our  next  result  is  the  converse  of  Proposition  1. 
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Proposition  3.  Suppose  that  {Wjk}  is  a  sequence  of  variables  with  all  Unite 
moments.  If  the  process 


oo  oo 

E  E  WikfAt) 

j——oo  fc=— oo 

is  well  dehned  (in  the  mean-square  sense),  then  we  have 

oo  oo  m 

CT;M(X(t, X(U))=  -  E 

ji,fci  =  -oo  j^,km=-oon=l 


§5.  Conclusion  and  Future  Work 

In  this  paper,  different  relationships  between  wavelets  and  cumulants  have 
been  presented.  Results  show  that  the  wavelet  transform  is  not  only  a  good 
tool  to  de-correlate  a  Gaussian  process,  but  it  also  gives  small  higher-order 
cumulants  of  a  non-Gaussian  signal.  The  Meyer-type  wavelet  is  particularly 
well-adapted  for  stationary  processes  since  they  give  null  wavelet  cumulants 
at  distant  scales. 

The  combination  of  wavelets  and  cumulants  has  not  yet  been  fully  ex¬ 
ploited.  The  statistical  study  of  estimators  of  the  bispectrum  based  on 
wavelets  is  of  particular  interest  for  application  with  real  data  sets.  Also  in¬ 
vestigating  the  properties  of  non-linear  and  non-stationary  times  series  models 
using  bi-spectral  methods  and  a  wavelet  decomposition  approach  needs  further 
research. 
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Interpolating  Functions  on  Lines  in  3-Space 


Martin  Peternell  and  Helmut  Pottmann 


Abstract.  Given  straight  lines  Li,  i  =  in  Euclidean  3-space 

with  associated  function  values  fi,  we  study  the  interpolation  problem  of 
constructing  a  smooth  real  valued  function  F  which  interpolates  values  fi 
at  given  data  lines  Lj.  The  function  F  shall  be  defined  on  the  entire  set 
of  lines  or  at  least  on  lines  contained  in  a  domain  of  interest  in  3-space. 


§1.  Introduction 

The  problem  of  constructing  an  interpolating  function  F  for  data  lines  Li  and 
corresponding  function  values  fi  is  a  scattered  data  Interpolation  problem  in 
the  set  of  lines  C  in  Euclidean  3-space  E^. 

A  variety  of  solutions  of  scattered  data  interpolation  problems  for  data 
points  Xi  e  U  with  U  =  R,”  or  P  C  R”  are  known,  see  [3].  Extensions  to 
spheres  and  other  surfaces  in  R^  are  described  in  [2]  and  references  therein. 

Scattered  data  interpolation  on  lines  is  quite  different,  since  the  set  of  lines 
C  is  not  a  Euclidean  space.  It  is  a  result  of  classical  geometry  that  the  set  of 
lines  C  of  projective  extension  of  Euclidean  3-space  is  a  4-dimensional 
quadratic  variety  in  projective  Thus,  the  general  formulation  of  the 
problem  is  as  follows:  Construct  a  function  F  :  >  R  interpolating  val¬ 

ues  fi  to  corresponding  data  lines  Li.  For  practical  purposes  it  is  sufficient 
to  construct  (or  represent)  functions  on  subsets  of  which  correspond  to 
domains  of  interest  in  E^,  containing  all  data  lines. 

The  solution  presented  here  will  be  the  following.  We  restrict  to  specific 
four-dimensional  subsets  Cq  of  .  These  subsets  possess  parametrizations 
R^  — >  Cq  with  the  property  that  distances  between  lines  in  Cq  are  induced 
by  special  positive  quadratic  forms  in  R'^.  This  fact  allows  us  to  apply  well- 
known  methods  in  R'^  to  solve  interpolation  (or  also  approximation)  problems. 

Applications  include  light  field  rendering  in  computer  graphics  [4].  Con¬ 
sidering  motion  planning  in  robotics,  the  method  applies  to  represent  a  dis¬ 
tance  function  of  robot  arms  (lines)  to  obstacles.  The  first  motivation  for 
studying  functions  on  lines  came  from  five  axis  milling.  There,  the  question 
occurs  of  how  to  represent  axis  positions  (lines)  of  the  cutting  tool. 
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§2.  Lines  in  Space 

An  oriented  line  L  in  Euclidean  3-space  is  determined  by  a  point  p  and  a 
unit  direction  vector  1  (||1||  =  1).  Together  with  the  moment  vector 

l  =  pxl,  (1) 

we  obtain  a  representation  of  L  by  a  sixtuple 

l  =  (1;I)  =  (;i,/2,/3;^4,/5,W.  (2) 

These  Ws  are  called  normalized  Pliicker  coordinates  of  L.  By  (1),  these  coor¬ 
dinates  are  not  independent,  but  satisfy  the  Pliicker  relation 


1*1  —  /1/4  -|-  /2?5  T  ^3^6  —  0-  (3) 

Substituting  1  by  -1  leads  to  coordinate  vector  -L  which  defines  the  same  line 
but  with  opposite  orientation.  To  get  more  information  about  the  structure 
of  lines  in  space,  it  is  necessary  to  study  the  set  of  lines  C  in  the  projective 
extension  P^  of  E^. 

E^  is  extended  to  P^  by  adding  points  and  lines  at  infinity.  Using  the 
analytical  model  points  in  P^  are  one  dimensional  subspaces  of  IR'^.  Thus, 
we  will  use  the  following  notation  for  points  in  P^, 


(xo,xi,a:2,a^3)lR  (Aa:o, . . . ,  Aa^a),  A  e  K. 

Let  a;  :  a;o  =  0  be  the  plane  at  infinity.  We  write  briefly  (xq,  x)lR,  with  x  e  IR^ 
for  points  in  P®.  The  transition  from  homogeneous  to  Cartesian  coordinates 
is  given  by 

.  ,X\  X2 

\^Xq,  Xi,  X2,  X^j  I  >  V  j  )  jj 
Xq  Xq  Xq 


which  is  obviously  only  possible  for  points  not  at  infinity. 

A  line  L  in  P^  usually  is  spanned  by  two  points  (po,p)lR  and  (go,q)lR- 
Homogeneous  Pliicker  coordinates  are  obtained  by 


L  =  (^1 , . . . ,  /e)  =  (Poq  -  goP,  P  x  q)-  (4) 

If  we  substitute  (po?p)  by  A{po5p)j  we  get  AL  such  that  the  /j’s  are  only 
determined  up  to  a  scalar  multiple.  This  proves  homogeneity  of  L. 

If  L  is  not  in  cu,  the  relation  to  definition  (2)  is  obtained  as  follows. 
Let  (pojP)IR  be  a  proper  point  on  L  such  that  we  can  switch  to  Cartesian 
coordinates  p  by  letting  po  —  1.  Further,  let  (go,q)lR  be  the  intersection 
point  u)  n  L  which  implies  go  =  0.  Inserting  this  into  (4)  gives  (2)  up  to  a 
normalization  of  the  direction  vector  q  =  1  of  the  line  L. 

If  L  is  in  w,  its  Pliicker  coordinates  are  (o,  a)lR  with  o  =  (0, 0, 0)  and 
some  not  vanishing  vector  a.  We  can  interprete  L  as  the  line  at  infinity  of  a 
pencil  of  parallel  planes  a  •  x  =  c,  with  c  E  IR-  All  these  planes  possess  a  as 
normal  vector. 
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Fig.  1.  Stereographic  projection  of  a  hyperboloid  Q. 

Since  L  and  AL  define  the  same  line  in  homogeneous  Pliicker  coordi¬ 
nates  (4)  define  points  LK  in  P®.  But  only  those  6-tuples  (a:i, . . .  ,X6)IR  are 
Pliicker  coordinates  of  a  line  X  in  P^,  which  satisfy 

3/2^5  ”1”  ^3^6  —  0* 

This  quadratic  variety  is  called  the  Klein  quadric  ,  where  upper  and  lower 
indices  denote  dimension  and  degree  of  this  variety.  The  maximal  dimension 
of  its  subspaces  is  2.  It  is  a  point  model  of  the  set  of  lines  C  of  P^.  The 
bijection 

from  lines  L  C  to  points  LIR  of  is  called  Klein  mapping. 

The  image  points  (o,  a)]R  of  lines  at  infinity  lie  in  the  plane  :  xi  = 
X2  =  X3  =  0  which  is  entirely  contained  in  All  lines  passing  through 
the  origin  O  =  (1,0,0,0)11  have  Pliicker  coordinates  L  =  (1,  o).  This  can  be 
checked  by  letting  p  =  (0, 0, 0)  in  formula  (1).  The  corresponding  image  points 
in  P^  lie  in  the  plane  Eo  X4  =  X5  =  xq  =  0.  In  general,  all  lines  through  an 
arbitrary  point  in  P^  possess  7-images  which  lie  in  a  2-dimensional  subspace 
of  M2 .  The  same  holds  for  lines  contained  in  an  arbitrary  plane  in  P^.  Thus, 
M2  contains  two  3-parametric  families  of  2-dimensional  subspaces. 

We  emphasize  that  C  and  C  are  not  Euclidean,  affine  or  projective  spaces. 

Local  coordinates  of  lines 

We  have  seen  that  C  is  isomorphic  to  M^  —  where  Pu,  consists  of 
image  points  of  all  lines  at  infinity.  Let  T  be  the  tangent  hyperplane  of  M^ 
at  a  point  Z  and  let  r  =  M^  fl  T.  It  is  known  that  r  is  the  7-image  of  all  lines 
intersecting  the  line  L  =  Z'y~^. 

Lemma  1.  M^  —  t  =  A^  is  an  affine  space. 

Proof:  This  lemma  is  a  result  of  classical  geometry,  and  is  proved  by  stere¬ 
ographic  projection.  Let  Q  be  a  regular  quadric  in  P^.  Let  P  be  a  point  in 
Q  and  T  its  tangent  hyperplane,  see  Figure  1.  Further,  consider  P  to  be 
a  hyperplane  in  P",  not  incident  with  Z.  The  intersection  r  =  Q  (iT  is  a 
quadratic  cone  with  vertex  Z.  The  intersection  e  —  P  D  T  is  a  hyperplane 
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Fig.  2.  Local  coordinates,  distance  function. 

in  E.  This  says  that  E  -  e  is  an  affine  space.  The  stereographic  projection 
cr:Q  —  r— >  E  —  e  with  center  Z  is  bijective  and  maps  points  P  £  Q  —  r  to 
points  P'  in  affine  space  E  —  e.  □ 

Figure  1  shows  a  low  dimensional  example.  Q  is  a  hyperboloid,  and  r  is 
a  pair  of  lines.  Planes  E  and  T  are  parallel  such  that  e  is  at  infinity. 

We  come  back  to  line  geometry  and  the  Klein  quadric  Let  Z  — 

(0,0, 0,0, 0, 1)]R  be  the  center  of  a  stereographic  projection.  It  is  the  7-image 
of  the  line  at  infinity  which  is  determined  by  horizontal  planes  =const.  with 
normal  vector  (0,0, 1).  The  tangent  hyperplane  T  at  Z  with  respect  to 
is  given  by  the  equation  =  0.  The  exceptional  set  r  =  D  T  consists 
of  7-images  (/i,  ^2, 0; . .  .)1R  of  all  horizontal  lines.  Lemma  1  says  that  all  non- 
horizontal  lines  form  an  affine  space  A^. 

Consider  two  horizontal  planes  Eq  :  z  =  0  and  Ei  :  z  ^  1.  The  inter¬ 
section  points  go  =  (5i,p2j0)  and  gi  =  (p3,p4,l)  of  a  line  G  and  planes  Ei 
(Figure  2)  define  a  parametrization  of  all  non-horizontal  lines  by 

=  1R2  X  ]r2 

(91,92,93,94)  ^  G. 

Pliicker  coodinates  of  G  are  G  =  (gs  -  91,94  —  92,  ^',92,  —9i,9i94  -  9293)-  The 
stereographic  projection  with  center  Z  onto  =  0  gives 

G'  =  (p3  -  91,94  -  92,^\92,~9i,^)- 

This  equals  (5)  up  to  a  linear  mapping.  Hence,  the  mapping  (5)  from  non¬ 
horizontal  lines  to  points  in  IR'^  is  geometrically  equivalent  to  a  stereographic 
projection  of  —  r. 

Distance  function  of  lines 

For  practical  purposes,  it  is  sufficient  to  consider  distances  of  lines  within  a 
domain  of  interest.  To  specify  this  domain,  we  will  consider  only  lines  which 
enclose  an  angle  <  (pQ  with  a  fixed  unit  vector  z.  The  unit  direction  vector  g 
of  such  a  line  G  satisfies 


g  •  Z  >  cos  00- 
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We  have  chosen  a  Cartesian  coordinate  system  with  z  as  third  axis.  Further, 
we  will  consider  only  segments  of  lines  between  two  planes  Eo,Ei,  bounding 
the  domain  of  interest.  This  is  motivated  by  the  fact  that  we  are  interested  in 
particular  in  distances  between  points  lying  between  those  planes.  Let  gi,hi 
be  intersection  points  of  lines  G,H  with  Ei,  and  consider  points  x,y  on  G 
and  H,  respectively, 

x(t)  =  (l-i)go+^gi, 

y{t)  =  (1  —  t)ho  +  thi. 

The  square  of  a  useful  distance  between  lines  G,  H  within  the  above  domain 
of  interest  is  defined  by 

Jo  (7) 

=  (go  -  ho)^  +  (gi  -  hi)^  +  (go  -  ho)  •  (gi  -  hi). 

It  measures  horizontal  distances  between  corresponding  points  x,  y  of  G,  H. 
We  will  not  distinguish  between  a  line  X  and  its  coordinate  vector  X  = 
{xi,X2tXs,X4)  in  H'*  according  to  parametrization  (5).  Formula  (7)  is  a  posi¬ 
tive  definite  quadratic  form  in  IR^  with  the  following  coordinate  representation 

{X,  X)  =  xl^  xl  +  xl-\-  x\-\-  xix-i  -h  X2X4. 

Remark  2.  These  distances  differ  from  orthogonal  distances  (from  a  point  to 
a  line  G)  only  by  a  factor  <  cos^q.  So,  taking  (Jjq  relatively  small  will  control 
the  difference  between  these  distances  and  the  Euclidean  distances  in  E^. 

Summary  3.  The  restriction  to  specihc  subsets  Co  of  line  space  allows  para- 
metrizations  £0-  A  positive  definite  quadratic  form  in  IR^  serves  to 

deSne  distances  between  lines  in  a  useful  manner. 


Choice  of  local  coordinates 

Distance  d  is  not  invariant  under  motions  in  E^,  but  depends  on  the  choice 
of  z  and  planes  Eq,Ei.  Consider  oriented  lines  Li,i  =  with  unit 

direction  vectors  If.  Assume  that  Ij  •  U  <  G.  This  expresses  that  the  angle 
between  any  two  lines  is  bounded  by  arccos(G).  A  good  choice  for  the  vector 
z  can  be  computed  as  solution  of  a  regression  problem.  Assuming  ||li||  =  1, 
we  want  to  maximize 

y^(ii  •  zf  (8) 

i=l 

over  all  unit  vectors  z.  Maximizing  the  quadratic  form  (8)  under  the  quadratic 
side  condition  z  •  z  =  1  leads  to  an  eigenvalue  problem  in  IR^.  Thus,  we 
found  a  possibility  to  construct  z  with  respect  to  a  set  of  lines  Li.  Planes 
Eq,  El  perpendicular  to  z  bounding  the  domain  of  interest  have  to  be  chosen 
depending  on  the  problem.  In  this  sense  we  can  say  that  the  coordinate  system 
is  connected  with  the  problem  in  an  invariant  way. 
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Fig.  3.  Definition  of  domains. 

If  direction  vectors  li  of  lines  Lj  are  distributed  over  a  whole  hemisphere  or 
more,  we  have  to  split  the  set  of  lines  into  subsets  and  perform  the  construction 
of  coordinate  systems  for  the  subsets.  Remark  2  gives  information  about  the 
deviation  of  distances  compared  to  usual  distances  in  E^. 

§3.  Representation  of  Functions  on  C 

Given  N  lines  Li  with  corresponding  function  values  /j,  we  would  like  to 
compute  a  function  F  :  £  — >  IR  with  F{Li)  =  fi.  This  is  a  scattered  data 
interpolation  problem  on  C,  (or  M^).  With  help  of  local  parametrizations  we 
obtain  scattered  data  interpolation  problems  on  IR'^.  The  given  algorithm 
consists  of  three  steps. 

1)  Find  a  covering  {Uj,j  ~  of  C  with  domains  Uj  which  are 

parametrized  over  IR^.  Decide  the  membership  of  lines  and  domains. 

2)  Compute  partial  solutions  Fj  of  the  interpolation  problem  for  all  domains 

Uj. 

3)  Merge  all  partial  solutions  Fj  in  a  global  solution  F  with  required  conti¬ 
nuity. 

First  of  all  we  want  to  find  a  covering  of  lines  Li  by  domains  Uj  with  1  < 
j  <  M.  We  choose  M  unit  vectors  Zj  and  real  numbers  Rj  which  serve  as 
centers  and  spherical  radii  of  caps  of  the  unit  sphere  S^.  These  caps  determine 
domains  Uj  in  the  following  way.  A  line  L  belongs  to  Uj  if  and  only  if 

1  •  z j  >  cos  Rj 

holds  for  its  direction  vector,  see  Figure  3.  Clearly,  L  can  be  contained  in  more 
than  one  domain.  We  determine  the  membership  of  all  lines  Lj  for  domains 
Uj. 

In  a  second  step  we  compute  partial  solutions  Fj  of  the  interpolation 
problem  for  each  domain  Uj.  This  is  done  by  letting 

Ni 

Jk=l 


Functions  on  Lines 


357 


where  Nj  shali  be  the  number  of  lines  Li  belonging  to  domain  Uj.  X  = 
{xi,X2,  xs,X4)  G  is  coordinate  vector  of  a  line  X  according  to  parametriza- 
tion  (5).  Bk{X)  are  (for  instance)  radial  basis  functions  and  depend  only  on 
the  distance  d(X,  Lk)~  The  coefficients  ajk  are  solutions  of  linear  systems.  The 
problem  of  regularity  of  such  systems  dependent  on  the  type  of  basis  function 
is  solved  in  [5] .  So  we  get  partial  solutions  Fj  valid  in  domains  Uj . 

In  the  last  step  we  have  to  merge  all  partial  solutions  to  a  unique  one. 
This  can  be  done  by  forming  a  weighted  sum 

M 

F{X)  = 

j  =  l 

The  weights  can  be  chosen  as 

^  ^  (1  -  arccos(x  ■ 

Efei(l-arccos(x.m,)/i?,r+’ 

where  nij  and  Rj  are  center  and  radius  of  the  spherical  cap  which  defines  Uj 
and  X  denotes  the  normalized  direction  vector  of  the  line  X.  The  notation 
(g)!j.  expresses  that  Wj(X)  is  positive  in  the  interior  of  Uj  and  is  zero  outside. 
This  says  that  (g)!j.  =  for  positive  g,  and  {q)^  =  0  otherwise. 

Weights  Wj{X)  are  in  the  differentiability  class  If  partial  solutions 

Fj  possess  the  same  smoothness,  then  also  F  is  in 

§4.  Visualization  of  Functions  on  Lines 

Since  the  dimension  of  C  is  four,  visualization  of  function  values  is  an  advanced 
topic.  In  general,  displaying  functions  on  low  dimensional  subsets  seems  to  be 
promising.  We  decided  to  choose  several  bundles  of  lines  for  evaluation  and 
want  to  describe  two  methods  of  visualization. 

We  choose  an  appropriate  number  of  points  within  the  domain  of 
interest,  and  evaluate  F  at  sufficiently  many  lines  passing  through  vertices 
Vj.  Let  Fmax  be  an  (existing!)  upper  bound  of  the  absolute  function  values. 
Consider  lines  Lij  with  function  values  F(Lij)  =  fij  passing  through  vertex 
Vj.  Assume  that  Lij  are  oriented  lines.  Displaying  the  star-shaped  surfaces 

Pi  =  Vi  +  (1  +  ■^)lij 

for  all  chosen  vertices  Vj  is  one  possibility  to  visualize  function  values.  If 
function  values  for  L  and  -L  are  equal,  the  pi  will  be  centrally  symmetric 
surfaces.  For  functions  on  nonoriented  lines,  we  will  use  both  direction  vectors 
lij  and  —lij  for  the  definition  of  Pi,  and  assign  the  same  function  value  fij  to 
them.  Thus  we  always  get  centrally  symmetric  surfaces.  Figure  4  shows  an 
interpolant.  The  test  function  is  a  function  of  the  distances  between  lines  Li 
and  points  (not  displayed). 

For  the  second  method  we  use  spheres  Si,  centered  at  vertices  Vi.  All 
lines  Lij  of  the  bundle  Vi  with  constant  function  values  form  a  cone  C  with 
vertex  Vi.  Intersecting  these  cones  C{ci)  for  several  constants  Ci  gives  level 
curves  on  spheres  Si  (not  displayed). 
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Fig.  4,  Visualization  of  functions  on  lines. 
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Numerical  Techniques  Based  on 
Radial  Basis  Functions 

Robert  Schaback  and  Holger  Wendland 


Abstract.  Radial  basis  functions  are  tools  for  reconstruction  of  mul¬ 
tivariate  functions  from  scattered  data.  This  includes,  for  instance,  re¬ 
construction  of  surfaces  from  large  sets  of  measurements,  and  solving  par¬ 
tial  differential  equations  by  collocation.  The  resulting  very  large  linear 
N  X  N  systems  require  efficient  techniques  for  their  solution,  preferably 
of  0{N)  or  0{N  log  N)  computational  complexity.  This  contribution  de¬ 
scribes  some  special  lines  of  research  towards  this  future  goal.  Theoretical 
results  are  accompanied  by  numerical  examples,  and  various  open  prob¬ 
lems  are  pointed  out. 


§1.  Introduction 

Many  problems  of  numerical  analysis  take  the  form  of  a  generalized  interpo¬ 
lation  in  spaces  of  multivariate  functions  [21].  Due  to  the  Mairhuber-Curtis 
theorem  [12],  such  spaces  cannot  be  fixed  beforehand,  but  must  necessarily 
depend  on  the  given  data.  For  a  plain  multivariate  interpolation  problem  on  a 
finite  set  X  =  {xi, . . . ,  xjv}  of  pairwise  different  points  in  a  domain  H  C 
there  is  an  easy  possibility  to  generate  a  data-dependent  space  via  linear  com¬ 
binations  of  something  that  depends  on  a  free  variable  x  €  C  and  the 
data  locations  Xj ,  namely 


Sx,^  ■=  span  {$(x,Xj)  :  1  <  j  <  N}  (1) 

with  a  fixed  function  $  :  Q  x  H.  The  numerical  generation  of  the  space 
can  be  simplified  considerably  in  the  special  situations 

1)  ^{x,y)  —  0(x  —  y)  with  (j)  :  — >  IR  (translation  invariance), 

2)  ^{x,y)  =  0([|x  -  y||2)  with  (j)  :  [0,oo)  IR  (radiality). 
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and  this  is  how  the  notion  of  a  radial  basis  function  came  up.  To  assure  that 
the  interpolation  in  the  points  of  X  =  {a:i, . . ,  is  uniquely  defined,  the 
matrix 

must  be  nonsingular.  By  definition,  positive  definite  functions  $  even  make 
this  matrix  symmetric  and  positive  definite,  and  the  positive  definite  radial 
functions 

0(r)  =  exp(-r^)  on  for  all  d  (Gaussian), 

0(r)  =  (1  —  r)^(l  +  4r)  on  d  <  3,  see  [25], 

are  typical  examples.  See  the  review  articles  [4,15,18,16]  for  details.  Though 
the  above  functions  are  scalar,  the  positive  definiteness  property  of  the  second 
depends  on  the  dimension  d  of  the  space  containing  x  and  y  when  forming  the 
scalar  argument  r  =  \\x  —  y\\2. 

These  two  examples  already  show  that  the  matrix  in  (2)  can  be 

sparse  or  have  a  strong  off-diagonal  decay.  It  will  be  very  large  if  we  consider 
real-world  problems  with  many  data,  arising  e.g.  from  inverse  engineering  or 
terrain  modelling.  If  the  data  points  are  rather  densely  scattered  over  the 
domain  fl,  the  approximation  power  of  the  space  (1)  will  be  very  good,  but 
the  matrix  in  (2)  will  have  lots  of  similar  rows  and  columns,  yielding  a  bad 
condition  number.  The  connection  between  these  phenomena  is  described  in 
some  detail  in  [17]. 

This  paper  concentrates  on  the  numerical  solution  of  large  symmetric  pos¬ 
itive  definite  systems  with  matrices  of  the  form  (2).  There  are  some  additional 
goals: 

1)  0{N)  complexity  of  solving  the  N  x  N  system, 

2)  0{V)  complexity  of  evaluating  an  element  of  (1)  with  N  terms  and 

3)  getting  away  with  n  «  N  terms  at  a  tolerable  loss  of  accuracy,  when 
interpolating  N  data. 

We  shall  describe  greedy  algorithms  as  recently  studied  by  deVore  [5]  and 
Temlyakov  [24],  but  we  shall  omit  multiple  scales  as  proposed  by  Floater  and 
Iske  [9]  and  continued  later  in  [13]  and  [6].  The  techniques  will  be  partially 
based  on  Krylov  subspaces  as  recently  and  independently  studied  by  Faul  and 
Powell  [8]. 


§2.  Splitting  the  Native  Space  Energy 

Dropping  in  the  notation,  we  can  write  functions  from  (1)  in  the  form 

N 

J  =  1 


(3) 
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with  c  e  X  =  {rci, . , . ,  a:jv}  C  C  ]R^  and  arbitrary  N.  Our  main  tool 
will  be  the  natural  inner-product 


(5c,x> )$  —  I  2/fc)  I  ^  ^Vk) 

\  3  k  )  ^  3  k 

(4) 

introduced  by  a  positive  definite  function  $  on  the  union  of  spaces  (1)  for 
arbitrary  data  sets  X.  We  note  in  passing  that  one  can  form  the  corresponding 
Hilbert  space  closure  to  get  the  native  Hilbert  space  of  but  we  refer  the 
reader  to  [11]  and  [19]  for  details.  If  we  fix  the  set  X  =  {xi, . . .  ,xjv}  and  the 
corresponding  positive  definite  matrix  we  get  an  inner-product 

^^,xd  (5) 

on  which  is  familiar  from  the  theory  of  the  conjugate  gradient  method 
based  on  Krylov  subspaces. 

Note  that  the  inner-product  (4)  is  zero  if  the  function  Sc,x  vanishes  on  Y 
or  if  Sd,Y  vanishes  on  X.  If  we  assume  Y  C  X  and  make  sure  that  Sd,y  agrees 
with  5c, X  on  Y  (e.g.  by  interpolation  on  Y),  then 


{sd,Y,Sc,X  -  Sd,Y)^  —  0 
l|5d,y|||  +  ||5c,X  -  ■Sd,y|||  =  ||5c,x||| 

The  second  identity  can  be  viewed  as  a  split  of  the  energy  of  the  function  Sc,x 
into  the  energy  of  its  interpolant  Sd,y  on  a  subset  Y  of  X  and  the  residual 
5c, X  —  <Sd,y.  We  shall  use  this  energy  split  over  and  over  again. 

§3.  Interpolation  in  Native  Spaces 

At  this  point,  we  digress  a  little  and  study  the  interpolation  of  an  arbitrary 
real- valued  function  f  on  a  domain  0,  C  On  each  fixed  finite  subset 
X  Cfl  we  can  interpolate  the  values  of  /  by  a  function  5/,x  from  (1).  Due  to 
(6),  the  energy  ||s/,x||l>  is  a  monotonic  function  of  X  with  respect  to  addition 
of  points,  and  it  can  be  easily  evaluated  using  (4).  The  energy  is  bounded 
independent  of  /  if  and  only  if  [22]  the  function  /  lies  in  the  native  space  of 
and  in  this  case  we  have  ||/||$  =  sup^  ||>s/,x|k- 
This  observation  has  some  consequences  for  applications.  If  the  user  does 
not  have  any  a-priori  information  on  /,  the  proper  choice  of  $  is  a  problem. 
But  if  the  behaviour  of  ||5/,x|||  with  respect  to  X  is  monitored  for  larger  and 
larger  sets  X,  the  user  can  switch  to  a  less  smooth  $  if  the  energy  values 
grow  dramatically  with  X.  By  (6)  the  behaviour  of  |i5/,x|||  is  related  to  the 
energy  ||/  ~  s/,x|||  of  the  residual,  and  further  study  of  this  as  a  function  of 
X  is  needed,  especially  for  /  with  additional  smoothness  properties.  Current 
starting  points  are  in  [20]  and  [10],  and  readers  are  encouraged  to  proceed 
from  there. 


(6) 
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§4.  Iteration  on  Residuals 

We  now  fix  a  positive  definite  function  and  a  function  /o  from  the  native 
space  of  $  or,  at  least,  from  some  space  of  the  form  (1)  for  a  rather  large  finite 
set  X,  Our  goal  is  to  reconstruct  /o  by  an  iterative  process.  Note  that  this 
solves  a  large  system  with  the  matrix  from  (2),  if  we  start  from  /o  :=  s/,x 
interpolating  some  given  function  /  on  X.  In  this  case,  we  do  not  reconstruct 
/  but  rather  its  interpolant.  In  both  cases  we  need  not  worry  about  the 
existence  of  ||/o||^- 

If  we  pick  some  numerically  manageable  finite  set  Yb  and  interpolate  /o 
on  Yq,  we  can  define  /i  :=  /o  —  and  proceed  iteratively  by 

fj+i  ■=  fj  -  ,  Yj  eft,  j  =  0,1,...  (7) 

using  finite  sets  Yj  that  we  shall  have  to  deal  with  later.  Anyway,  the  energy 
splitting  (6)  yields 

Ik/i.v'jlll  +  Wfj  -  =  II/jIII 

and  by  summation  we  get  the  telescoping  sum 
k  k 

E  lk/„n  111  =  E  Wfi  -  /i+illl  =  WfoWi  -  IIA+illl  <  ll/olll  (9) 

j=o  i=o 

which  must  necessarily  converge  for  fc  — >  oo  even  if  the  choice  of  Yj  is  bad, 
e.g.  Yj  =  Yq  for  all  j.  By  standard  Hilbert  space  argumentation  via  Cauchy 
sequences,  the  functions 

k  k 

^*^+1  ~  ^3+1)  =  fo-  /fc+i 

must  converge  in  the  norm  ||.||^>  to  some  element  g  in  the  native  space,  but 
we  do  not  want  to  use  this  fact.  Our  goal  is  to  prove  that  the  residuals  fk 
converge  to  zero,  and  this  would  imply  that  the  functions  gk  converge  to  /o, 
yielding  the  desired  reconstruction. 

Of  course  we  shall  need  some  additional  assumptions  on  the  sets  Yj  to 
be  successful.  Equations  (8)  and  (9)  suggest  that  we  should  let  take 

up  as  much  energy  from  fj  as  possible,  and  this  will  be  our  guideline  for  the 
convergence  analysis  in  the  following  sections. 

§5.  Conditions  for  Linear  Convergence 

For  simplicity,  let  us  first  assume  that  s/.^y,  picks  up  at  least  a  fixed  percentage 
of  the  energy  of  fj,  i.e. 

H-.v,l||>7ll/illl  (10) 

with  some  fixed  7  G  (0, 1].  This  is  a  disguised  hypothesis  on  the  proper  choice 
of  Yj,  and  we  have  to  prove  later  how  to  satisfy  this  assumption.  From  (8)  and 
(10)  we  conclude  linear  convergence  of  fj  to  zero  via  ||/j+i|||  <  (1  -  7) ||/;  III' 
This  proves 
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Theorem  1.  If  the  choice  of  sets  Yj  satisfies  (10),  the  residual  iteration  (7) 
converges  linearly  in  the  native  space  norm,  and  there  is  an  error  bound 

ll/o- Skill  =  ||A|||<(i-7)'=II/oI||. 

Assumption  (10)  is  not  easy  to  handle,  because  the  norm  involves  $ 
and  the  value  of  the  right-hand  side  is  not  explicitly  known.  But  in  case  of 
/o  ;=  Sf^x  for  some  large  finite  set  X  C  C  we  can  restrict  ourselves  to 
sets  Yj  C  X,  and  all  functions  fj  will  stay  in  the  finite-dimensional  space  (1). 
On  this  space,  we  can  pick  any  norm  ||.||x,  for  instance  any  discrete  Lp  norm 
of  functions  on  X,  and  make  use  of  the  norm  equivalence 

c||5||x  <  ||s|k  ^  (11) 

for  all  functions  s  from  the  space  (1),  where  the  constants  satisfy  0  <  c  <  C. 
Then  we  can  try  to  get  away  with 

\H.YAx>mj\\i  (12) 

with  some  S  €  (0, 1]  instead  of  (10).  But  since  this  equation  implies  (10)  with 
7  >  >  0,  we  get 

Theorem  2.  If  the  choice  of  sets  Yj  satisfies  (12)  for  some  norm  ||.||x  of 
functions  on  X,  the  residual  iteration  (7)  for  reconstruction  of  fo  :=  Sf^x 
converges  linearly  in  j|.|jx  and  there  is  an  error  bound 

11/  -  9k\\x  <  -  (^1  -  ll/IU- 


§6.  Maximizing  Energy  of  Interpolants 

Our  argument  at  the  end  of  Section  4  leads  to  the  problem  of  finding  a  finite 
set  Y  such  that  the  energy  ||5/,y|||  of  the  interpolant  of  some  function  (or 
residual)  /  is  large.  If  fy  ^  is  the  vector  of  values  of  /  on  T,  the 

interpolant  s/,y  solves  a  system  with  a  matrix  A^^y  defined  as  in  (2),  and  the 
energy  is  given  by  the  quadratic  form 

lk/,y|||  =  /y>l$,y~^/y  >  ||/y||2'^mm  (A$,y“^)  =  \\fy\\l/Xmax  (A$,y) 

as  a  function  of  /  and  Y.  The  maximal  eigenvalue  Xmax  (A$,y)  is  hard  to 
discuss  in  general  (see  Narcowich  and  Ward  [14]  for  results),  and  we  simply 
view  this  quantity  as  a  factor  that  depends  on  the  geometry  of  Y  and  the 
number  |T|  of  points  in  T.  It  is  an  interesting  open  problem  to  design  some 
Remes-type  algorithm  based  on  exchanges  of  points  to  arrive  at  the  best  choice 
of  a  set  Y  with  a  prescribed  number  of  points. 
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In  the  special  case  jyi  =  1,  F  =  {?/}  things  are  easy.  We  get 

lls/.rlll  =  f{y?Hy^y)  > 

and  the  maximum  of  will  be  the  best  choice,  especially  if  is  translation- 
invariant  or  radial. 

If  we  have  Y  C  X  for  a  large  finite  set  X,  we  can  invoke  Courant’s 
minimum-maximum  principle  to  get  Xmax  ^  Xmax  as  an  upper 

bound  that  does  not  depend  on  the  choice  of  Y.  A  reasonable  strategy  for 
maximizing  ||5/,y|||  then  is  to  pick  the  |y|  points  of  X  where  /  takes  its 
largest  absolute  values. 

In  the  general  situation,  we  have  to  face  the  fact  that  coalescing  points 
are  not  allowed.  A  reasonable  strategy  is  to  mimic  the  previous  situation,  i.e. 
to  take  some  large  set  X  of  well-distributed  points  and  pick  the  points  of  X 
where  f  is  largest  in  absolute  value. 

Another  possible  strategy  is  the  iterative  greedy  collection  of  more  and 
more  points,  forming  a  recursive  Cholesky  factorization.  Since  this  possibility 
does  not  seem  to  be  familiar  to  researchers  in  this  area,  we  outline  the  process 
here.  Assume  that  an  interpolant  to  /  on  K  is  available  together  with  the 
inverse  B  of  We  now  want  to  add  another  point  z  G  H  \  V  to  y, 

thus  enlarging  the  energy  of  the  interpolant.  A  naive  choice  of  z  is  via  the 
maximum  of  the  absolute  value  of  /  —  s /,y ,  but  since  we  have 

y€y 

the  best  choice  of  z  for  fixed  Y  is  obtained  by  maximizing  the  right-hand  side 
of  this  equation.  Having  found  z,  one  has  to  update  B  in  a  suitable  way. 
First,  calculate  the  vector  v  G  with  components  $(z,  y),  y  ^Y  and  form 
w  :=  Bv.  The  number 

1/a  :=  $(z,  z)  -  v'^w  =  4>(z,  z)  ~  v^Bv 

can  be  shown  to  be  positive,  because  $  is  positive  definite  and  z  does  not 
belong  to  y.  Then  form  u  :=  —aw  and  C  B  vXu/a.  The  matrix 

C  u 

T 

u  a 

then  is  the  inverse  of  yu{£}  needed  for  the  interpolation  on  y  U  {z}.  Un¬ 
fortunately,  there  is  no  numerical  experience  in  this  direction  so  far,  especially 
for  the  maximization  of  (13).  A  more  careful  calculation  of  the  numerical  com¬ 
plexity  reveals  that  we  have  nothing  else  here  than  a  special  formulation  of 
the  partial  Cholesky  algorithm  with  pivoting.  The  choice  of  pivots,  however, 
is  adapted  to  the  setup  of  the  problem  as  an  interpolation. 

Altogether,  this  section  was  intended  to  motivate  readers  to  look  at  the 
problem  of  finding  good  finite  sets  Y  for  improvement  of  interpolants. 
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§7.  A  Simple  Greedy  Algorithm 

Among  other  things,  the  previous  section  showed  how  to  work  on  subsets 
Y  consisting  of  a  single  point  y  each.  The  best  possible  choice  is  to  take 
the  point  where  /  takes  its  maximum  absolute  value,  and  the  interpolant  is 
^f,{y}  —  fiy)^{y^  *)/^(2/j  y)-  We  now  do  this  iteratively  in  the  sense  of  (7)  by 
picking  Yj  :=  {yj}  with  \fj{yj)\  =  ||/j||c»-  In  the  “discrete”  case  /o  :=  s/.x  we 
take  the  Chebyshev  norm  on  X,  while  in  the  “continuous”  case  fo  '•=  f  G  (7(f2) 
with  n  being  a  compact  subset  of  we  take  the  Chebyshev  norm  on  O. 

Due  to  Theorem  2,  the  discrete  case  leads  to  linear  convergence  towards 
fo  :=  5/,x,  because  (12)  is  satisfied  with  <5  =  1.  From  a  theoretical  viewpoint, 
this  is  much  better  than  the  non-quantitative  convergence  result  of  Faul  and 
Powell  in  [7].  On  the  other  hand,  there  always  is  the  conjugate  gradient 
method  as  a  competitor,  and  it  has  linear  convergence,  too.  But  it  needs  to 
form  matrix-vector  products,  while  our  greedy  algorithm  does  not  even  store 
the  matrix.  It  simply  needs  two  arrays  of  length  |X|  for  the  residuals  and  the 
coefficients,  and  in  each  cycle  it  updates  one  cofficient  and  runs  once  over  the 
residuals  to  update  them  and  find  the  maximum  for  the  next  iteration.  This 
extreme  numerical  simplicity  must  come  at  a  price,  and  the  price  is  very  slow 
convergence  after  some  good  progress  in  the  first  few  iterations.  We  report 
on  numerical  experiments  and  adaptive  multiscale  improvements  in  [23],  but 
at  this  point  we  want  to  direct  the  reader’s  attention  to  extend  the  above 
strategy,  e.g.  via  some  suitable  preconditioning. 

Before  we  look  more  closely  at  the  greedy  algorithm  in  the  discrete  case, 
let  us  digress  a  little  into  the  continuous  case. 

Theorem  3.  If  $  is  a  continuous  translation-invariant  positive  deBnite  func¬ 
tion  on  a  compact  domain  fl  the  greedy  algorithm  for  interpolation  of 

a  function  f  from  the  native  space  of  $  converges  uniformly. 

Proof:  We  have 


and  (9)  shows  that  the  quantities  ||/j||^  are  summable.  Consequently,  the 
residuals  fj  converge  uniformly  to  zero  on  the  compact  set  H.  □ 

Corollary  4.  Under  the  assumptions  on  $  as  in  Theorem  3,  the  native  space 
norm  is  expressible  via  a  series 

oo  oo 

ii/iil  =  ^(o)E/'fe)  =  ^(o)  Ell/ill- 

j=0  j=0 


where 


fo  :=  /,  \fj{yj)\  =  ll/illoo,  fj+i  :=  fj  -  fj{yj)(f>{-  -  2/j)M0). 


This  result  may  look  complicated  at  first  sight,  but  it  should  be  compared 
to  other  definitions  of  the  native  space  norm,  e.g.  via  Fourier  transforms,  by 


366 


R.  Schaback  and  H.  Wendland 


abstract  completion  of  a  pre-Hilbert  space,  or  by  the  supremum  of  the  action 
of  certain  functionals. 

We  do  not  want  to  go  into  details  here  (see  [11],  for  instance),  but  prefer 
to  give  an  illustrative  example.  If  specialized  to  Sobolev  space  W2  (fi)  with 
k  >  (i/2,  one  has  to  take  0(a:)  =  ||rr]|2  ^^^A';t_d/2(||a:||2)  in  order  to  recover 
Sobolev  space  as  a  native  space  for  ^{x,y)  =  0(||x  -  y\\2)-  Now  by  Corollary 
4  one  gets  the  Sobolev  norm  of  a  function  as  a  series  containing  just  function 
values  on  the  domain  in  a  numerically  accessible  way,  using  neither  derivatives 
nor  integration  (but,  of  course,  maximization).  It  should  be  pointed  out  that 
this  technique  provides  some  means  to  assess  the  Sobolev  smoothness  of  a 
given  function  numerically.  Readers  are  encouraged  to  proceed  from  here. 


§8.  Dual  Techniques 

Another  possible  approach  to  solving  a  large  N  x  N  system  with  a  large 
symmetric  and  positive  definite  coefficient  matrix  via  smaller  finite  sub¬ 
problems  is  to  define  certain  finite-dimensional  subspaces  Sj  of  the  native 
space  and  to  approximate  the  exact  solution  on  A  =  {0:1, . . , ,  a:;v}  by  approx¬ 
imation  in  the  native  space  norm.  More  precisely,  the  iteration  starts  like  in 
Section  4  with  some  function  /o  and  j  :=  0,  and  iterates  like  (7)  according  to 


ll/j  -Sjik  :=  inf  ll/j  -s|k 

s^Sj 

fj  +  1  •=  fj  —  Sj. 


(14) 


By  standard  arguments,  this  iteration  also  satisfies  (8)  and  the  rest  of  Section 
4,  including  the  summability  condition  (9).  Note  that  if  a  space  Sj  =  Syj,^ 
has  the  form  (1)  for  some  finite  set  V},  then  the  best  approximation  solution 
Sj  in  (14)  coincides  with  and  we  are  back  to  the  method  in  Section  4. 

This  observation  follows  from  Theorem  7  in  Section  9. 

But  there  are  other  possible  choices  for  the  spaces  Sj.  In  particular,  Paul 
and  Powell  [7]  pick  certain  one-dimensional  spaces  Sj  ~  span{wj}  for  all  j  > 
0.  Then  Sj  :=  ajUj  with  aj  :=  {uj,  fj)^/{uj,Uj)^  solves  the  approximation 
problem,  and  we  have  the  summability  condition 


k 


j=o 


=  EkiiI“i  =  E( 


3=0 


3=0 


(15) 


§9.  Cyclic  and  Greedy  Dual  Strategies 

In  [7],  Paul  and  Powell  fix  N  such  functions  Uj  by  a  certain  precalculation  that 
we  shall  discuss  later.  These  functions  are  used  periodically,  i.e.  Uj  is  used  in 
step  j  -f  kN  for  all  k  >  0.  The  periodic  reuse  has  the  advantage  that  one  can 
precalculate  and  store  the  Uj,  if  their  construction  is  somewhat  involved.  We 
start  with  a  generalized  and  simplified  version  of  the  convergence  result  in  [7]: 
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Theorem  5.  If  fo  is  in  the  span  of  the  functions  Uj  for  1  <  j  <  iV,  then  the 
cyclic  dual  method  of  Faul  and  Powell  converges  to  fo- 

Proof:  Since  everything  takes  place  in  a  finite-dimensional  space,  and  since 
the  technique  involves  an  energy  split,  the  functions  Qj  =  fo  —  fj  converge  to 
some  function  g  in  the  span  of  the  uj,  and  the  fj  converge  to  fo  —  g-  But  as 
(15)  implies 

lim  {jr^Jj+kN)^  =  0  =  9)^ 

k—*oo  ||Ujl|$ 

for  all  j,  the  functions  fo  and  g  must  coincide.  □ 

There  are  lots  of  choices  of  uj  that  satisfy  the  hypothesis  of  Theorem  5. 
Conjugate  directions  and  uj  :=  would  do  the  job.  The  latter  strategy 

coincides  with  the  greedy  method,  if  the  cyclic  choice  is  given  up  in  favour  of 
picking  the  point  where  the  residual  is  maximal  in  absolute  value.  A  linear 
convergence  result  is  possible,  if  such  a  modification  is  made  in  general: 

Theorem  6.  If  fo  is  in  the  span  of  the  functions  uj  for  1  <  j  <  N,  then  the 
iteration  (14)  with 

«=  (16; 

Sj  :=  span  {ukj } 

converges  linearly  to  fo- 

Proof:  We  can  proceed  as  in  Section  5,  using 

ll^llx  :=max{u,Uj)% 


for  all  functions  u  in  the  span  U  of  all  uj.  The  assumption  (12)  is  satisfied  for 
Sj  instead  of  Sf.^Yj  due  to 


{fji'^kj  )^ 

Sj  —  -J  r  Uk^ 

(Ukj.Uk^)^ 

Ikillx  ^  {^3^'^kj)%  =  (/j»Wfej)|  = 


and  the  rest  follows  easily.  □ 

The  inner-products  in  (16)  can  be  evaluated  explicitly,  if  we  work  in  the 
space  (1)  and  use  (4)  and  (5)  in  the  form 


(5c.x,Srf,x)$  =  'Y^CkSd,x{xk)-  (l*^) 

k 

This  is  particularly  efficient  if  the  functions  Uj  have  only  a  small  number  of 
nonzero  coefficients  in  their  representation  of  the  form  Uj  =  Another 

possibility,  exploiting  the  dual  nature  of  the  algorithm,  is  to  store  and  update 
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the  inner-products  (/j, ujt)$  instead  of  the  values  fj{xk)-  So  far,  there  is  no 
numerical  experience  with  dual  greedy  algorithms,  unfortunately. 

One  has  a  lot  of  leeway  for  picking  suitable  functions  uj,  especially  when 
preconditioning  arguments  come  into  play.  Faul  and  Powell  use  local  Lagrange 
functions  uj  based  on  relatively  small  subsets  Yj  of  X  that  contain  Xj.  In 
particular,  Uj  £  Syj,^  is  defined  by  the  interpolation  conditions 

Uj{Xj)  =  1, 

Uj{xk)  =  0  for  all  Xfc  G  Ij,  k  ^  j, 
and  is  expressible  in  the  form  Uj  =  with  at  most  \Yj\  nonzero 

coefficients.  The  precalculation  involves  the  solution  of  N  systems  with  \Yj  \  x 
\Yj  \  matrices  ,and  it  can  be  kept  at  0{N)^  if  the  values  \Yj  \  are  bounded 
independent  of  N.  Our  arguments  in  Section  10  will  show  how  this  technique 
can  be  interpreted  as  preconditioning  the  matrix  A^^x-  For  a  fixed  accuracy 
to  be  obtained,  and  for  their  special  choice  of  the  sets  Yj,  Faul  and  Powell 
then  observe  that  they  need  only  a  small  fixed  number  of  cycles  of  the  dual 
algorithm.  Each  cycle  has  N  one-dimensional  subproblems,  but  there  are 
techniques  to  keep  each  subproblem  at  a  reasonable  complexity,  provided  that 
techniques  like  multipole  expansions  [1]  or  compactly  supported  radial  basis 
functions  [26,  25]  are  used. 

The  selection  of  functions  uj  is  particularly  good  if  there  are  orthogonality 
or  conjugacy  relations  among  them.  Let  us  look  at  an  inner-product  {uj,Uk)^ 
in  case  of  (18),  using  (17)  and  uj  =  We  get 

(Uj,Uk)^=  ^  c^^Uk{x^), 

and  this  quantity  vanishes  if  Yj  CYk\ 

This  can  be  seen  as  a  motivation  for  choosing 

Xj  eYj  C  {xj,Xj+i,...,XN}  (19) 

as  done  by  Faul  and  Powell.  Even  if  the  functions  Uj  are  in  general  not  mutu¬ 
ally  orthogonal  they  are  at  least  linear  independent  as  needed  for  Theorems 
5  and  6.  To  see  this  note  that  the  matrix  C'  =  (c^)  which  describes  the  tran¬ 
sition  from  the  basis  ($(-,a::j),l  <  j  <  N)  to  {uj,l  <  j  <  N)  is  an  upper 
triangle  matrix  and  thus  invertible  if  cj  ^  0  for  1  <  i  <  N.  This  is  indeed  the 
case  because  of 

0  5^ll“Illl=  Y  =ci. 

m-.XmE^yj 

We  finish  this  section  by  pointing  out  how  to  make  optimal  use  of  solving 
N  systems  with  |>j|  x  \Yj\  matrices  A^^Yj  for  subsets  Yj  in  a  precalculation. 
If  the  full  inverses  of  the  A^^Yj  are  stored  instead  of  the  coefficients  of  Uj, 
one  can  use  the  cyclic  dual  algorithm  with  Sj  :=  Sy^,^-  The  energy  split  at 
each  step  of  the  algorithm  will  then  be  better  or  equal  to  the  split  obtained 
by  the  dual  cyclic  algorithm  using  a  single  Uj  €  like  the  one  used  by 

Faul  and  Powell.  This  is  clear  from  (14),  and  the  following  theorem,  which 
is  well  known  since  the  advent  of  splines,  shows  that  we  end  up  with  a  cyclic 
interpolatory  method  of  the  form  (7). 
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inf  llZ-sll* 

sGoy,* 

for  any  f  in  the  native  space  of  $  is  solved  by  the  interpolant  Sf^y 

Proof:  Equations  (6)  generalize  via  continuous  transition  to  the  Hilbert  space 
completion  to 

-  s/,y)^  =  0 

lk/,y|||  +  11/  -  -s/.ylll  =  ll/lll 

for  all  /  in  the  native  space,  and  the  assertion  follows.  □ 

Consequently,  algorithms  using  interpolants  on  finite  subsets  make  opti¬ 
mal  use  of  the  information  contained  in  the  space  This  links  the  dual 

techniques  back  to  the  interpolatory  methods  in  Section  4.  Numerical  results 
concerning  the  above  cyclic  interpolatory  method,  e.g.  using  the  sets  Yj  of 
Faul  and  Powell,  are  still  missing.  The  progress  must  be  better  due  to  The¬ 
orem  7,  but  at  the  expense  of  much  more  storage.  And,  an  incorporation 
of  greedy  selections  using  the  good  preconditioning  power  of  the  Faul-Powell 
approach  seems  worth  investigating. 

§10.  Quasi-Interpolation 

There  is  a  hidden  link  between  the  Faul-Powell  technique,  preconditioning  of 
and  certain  quasi-interpolation  methods  using  local  Lagrange  functions, 
as  investigated  by  Beatson,  Powell  and  their  coworkers  (see  for  example  [2]). 
If  we  write  the  interpolant  to  some  function  /  in  Lagrange  representation 

N 

Sf,x  (20) 

j-i 

with  N  Lagrange  basis  functions  Vk  E  Sx,^  satisfying  Vj{xk)  =  we  can 
relax  (20)  to  a  quasi-interpolation  formula 

N 

Sf,u,x  :=  Y^fixj)uj  (21) 

j=i 

for  any  other  choice  of  functions  uj  that  approximate  the  global  Lagrange 
basis  functions  Vj.  The  choice  (18)  for  certain  subsets  Yj  is  quite  natural, 
because  one  can  often  [3]  observe  that  local  Lagrange  functions  based  on  a  set 
Yj  of  neighbouring  points  to  Xj  G  Yj  decay  quickly  away  from  Xj.  Assuming 
(18)  (but  not  (19))  from  now  on,  the  representation  (21)  can  be  rewritten  in 
terms  of  Uj  =  Sci,x  (3)  as 

j=l  k-.xkeYj  k=l  jiXkeYj 
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The  coefficients  of  the  second  representation  can  be  evaluated  locally,  and 
the  computational  advantage  is  particularly  evident  in  case  of  compactly  sup¬ 
ported  radial  basis  functions. 

We  now  want  to  look  at  the  quality  of  such  quasi-interpolants  on  the 
discrete  set  X  itself.  The  operator  that  maps  the  vector 

/|^  :=  (/(xi),...,/(x^,)y  6]R~ 

to  s/,tx,X|^  €  can  be  written  as  the  matrix  product  ■  C,  where 

C  =  (cj.)  is  the  nonsymmetric  N  x  N  matrix  with  row  index  k  and  column 
index  j  containing  the  coefficients  cj.  of  the  Uj  columnwise.  The  operator  that 
generates  the  residuals  on  X  then  is  En  —  •  C  with  the  N  x  N  identity 

matrix  Ej^.  In  case  of  =  X  for  all  j  we  have  C  =  and  there 

are  good  reasons  to  expect  that  there  are  numerically  interesting  cases  where 
some  matrix  norm  of  E^  —  •  C  is  smaller  than  one.  In  such  cases  one 

can  solve  the  problem  on  X  by  successive  quasi-interpolation  via  a  Neumann 
series.  In  terms  of  vectors  and  containing  the  values  of  residuals  fj  and 
quasi-interpolants  to  fj  on  X,  we  have  the  linearly  convergent  iteration 

f  :=/ou 

y  :=  A^,x-Cf 
/J+i  :=  fi  -s>  =  (En  -  Ai,x  ■ 

calculating  the  interpolant  to  the  data  of  /o  on  X  as  the  sum  over  the  .  Note 
that  we  cannot  use  the  energy  split  here,  because  we  have  left  the  context  of 
interpolation  and  approximation.  Note  further  that  C  acts  as  a  (nonsymmet¬ 
ric!)  preconditioner  or  an  approximate  inverse  to 

§11.  Experiments  Concerning  Quasi-Interpolation 

To  calculate  the  norm  of  Eat  —  Aip^x  •  C  numerically,  we  observe  that  the 
matrix  A^^x  ■  C  has  the  entries  where  i  is  the  row  index.  Thus  the 

entry  at  (i,  j)  of  E^  —  A^^x  •  C  vanishes  for  Xi  G  Yj,  and  the  column-sum 
norm  of  Ejx  —  A^^x  •  C  can  be  written  as 

max  ^2  (22) 

Again  it  turns  out  that  the  decay  of  local  Lagrange  functions  is  essential. 

In  case  of  data  on  the  uniform  grid  {hTZ)'^,  a  radial  basis  function  4)c 
with  support  in  [0,c],  and  sets  Yj  :=  {y  G  (hZZ)^  :  \\xj  —  y\\2  <  R}  of 
neighbours  to  Xj  within  a  radius  R,  the  norm  in  (22)  can  be  evaluated  by 
looking  at  the  local  Lagrange  function  uq  with  respect  to  the  origin  and  the 
set  Yb  :=  {y  G  (/lE)^  :  j|2/[|2  <  i?}  of  local  interpolation  points.  Since  both 
Yq  and  the  support  of  (/>  are  bounded,  the  function  uq  is  zero  on  integer  grid 
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Fig.  1.  <7  =  4,  i?  =  6,  norm  =  0.48,  and  C  =  5,  i?  =  8,  norm  =0.29. 


N 

1  5 

9 

13 

21 

25 

29 

37 

45 

49 

57 

61 

Afo.9 

5 

5 

5 

9 

29 

49 

69 

81 

97 

145 

145 

Afo.i 

5 

5 

21 

29 

109 

137 

149 

Tab.  1.  Point  numbers  Mp  required  for  norm  <  p  and  N  points  in  support  of  <j). 


points  outside  the  disk  around  zero  with  radius  i?  +  c.  Omitting  the  value 
1  at  the  origin  for  scaling  reasons,  Figure  1  shows  the  behaviour  of  uq  on 
integer  gridpoints  around  the  origin.  We  picked  two  cases  for  the  function 
(j)c{r)  :=  (1  —  r/c)^(l  +  4r/c)  from  [25]  where  the  norm  of  En  —  •  c  is 

smaller  than  one,  and  the  corresponding  numbers  of  local  interpolation  points 
in  Yo  are  113  and  197,  respectively. 

For  applications,  it  is  necessary  to  know  how  large  R  must  be  for  fixed 
c  and  h  in  order  to  make  the  norm  of  Ejsf  —  •  c  smaller  than  0.9  or 

0.1,  say.  Since  R  and  c  scale  with  /i,  the  numbers  M  and  N  of  points  in  Yq 
and  the  support  of  (f)  depend  on  R/h  and  C/h,  respectively.  Given  a  support 
radius  c  and  a  maximal  meshwidth  h  such  that  the  support  of  contains 
N  =  1, 5, 9, 13, . . .  points,  we  provide  in  Table  1  the  minimal  number  Mp  of 
points  in  Yq  that  are  necessary  to  keep  the  norm  of  J5jv  —  A^^x  ’  c  below  p. 
Another  way  of  reading  Table  1  is  that  if  the  matrix  A^^x  for  interpolation 
by  ^{x,y)  :=  ^c(||a^  —  2/II2)  on  a  regular  grid  has  bandwidth  N,  then  it  has 
an  approximate  inverse  with  bandwidth  Mp  that  leads  to  a  residual  matrix 
of  norm  p.  The  quasi-interpolant  is  to  be  calculated  via  subproblems  with 
Mp  X  Mp  matrices.  It  is  an  interesting  challenge  to  provide  sparse  approximate 
inverses  for  sparse  symmetric  positive  definite  matrices,  because  normally  the 
exact  inverses  will  not  be  sparse. 
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§12.  Conclusions 

At  first  sight,  our  results  on  linear  convergence  look  promising,  but  they  still 
are  too  weak  to  provide  a  convergence  rate  that  is  independent  of  Ny  since 
no  preconditioning  techniques  are  involved.  Improvements  should  thus  focus 
on  preconditioning,  e.g.  along  the  lines  of  Paul  and  Powell.  Greedy  methods 
for  fixed  $  are  limited  to  quick-and-dirty  approximations  with  few  nonzero 
coefficients  and  need  extension  to  multiscale  techniques.  The  adaptive  greedy 
method  in  [23]  is  a  first  step,  but  the  results  shown  there  imply  that  it  has  to  be 
stopped  before  it  runs  into  scales  that  are  too  small.  A  possible  continuation  at 
small  scales  is  provided  by  quasi- interpolation  as  outlined  here.  A  combination 
of  both  techniques  generates  approximations  which  consist  first  oi  K  «  N 
global  terms  obtained  by  an  adaptive  greedy  method,  followed  by  N  local 
terms  constructed  by  quasi-interpolation.  The  overall  complexity  can  thus  be 
kept  at  0{N), 
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Parametric  Polynomial  Curves  of 
Local  Approximation  of  Order  8 

K.  Scherer 


Abstract.  Parametric  approximation  of  curves  offers  the  possibility  of 
increasing  the  order  of  approximation  by  using  the  additional  parameters 
in  the  parametrization  of  the  curve.  This  has  been  studied  in  several 
papers,  see  e.g.[l-8].  The  resulting  problems  are  highly  nonlinear.  Here 
the  cases  of  approximation  order  0{h^)  are  studied  which  need  piecewise 
quartic  curves  in  the  plane  and  piecewise  quintic  curves  in  space. 


§1.  Introduction 

A  general  conjecture  concerning  the  local  approximation  order  by  polynomial 
curves  can  be  formulated  as  follows  (see  e.g.  M0rken- Scherer  [6]): 

•  For  sufficiently  small  h  >  0  and  a  sufficiently  smooth  curve  f{t)  :  t  G 
[0,  /i]  — >■  /(t)  G  there  exists  a  polynomial  curve  p{t)  of  degree  n  and 
a  reparametrization  99  of  /  on  [a,  6]  such  that 

sup  ||(/0VJ)(t)  -p(0ll  <  C{f)h"'  , 

0<t<h  —  -  — 

where  m  :=  n  +  1  +  [^zyl- 

The  increased  order  m  is  explained  in  [6]  by  the  principle  of  degree  reducing. 
It  comes  from  the  idea  of  approximating  with  an  interpolating  polynomial 
curve  p{t)  of  degree  m  -  1  such  that 

p(ti)  =  (/o(/>)(ti),  l<i<m,  (1) 

for  points  ti  in  [0,/i]  (multiplicities  allowed).  The  additional  parameters  oc- 
curing  via  the  reparametrization  0  are  used  to  reduce  the  degree  m  —  1  of  p{t) 
by  requiring 

[tl,  .  .  .  ,  tm-i]{p)  =  [tl,  .  .  .  ,  0(1))  =0,  i  =  0,  .  .  .  ,  /C  -  1,  (2) 
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with  k  as  large  as  possible.  Since  we  have  to  normalize  (j)  such  that  0(0)  = 
0, 0(/i)  =  h,  there  are  m  —  2  parameters  left  at  our  disposal  for  this  goal.  Thus 
m  — 1  can  be  reduced  to  n  =  m— 1  — fc,  where  k’d  <  m  —  2ovk  <  {n  —  \)/{d~\). 
From  classical  approximation  theory  it  is  then  clear  that  a  solution  of 

(1) ,(2)  yields  a  polynomial  curve  p{t)  of  degree  m  —  I  —  k  satisfying 

sup  ||(/o«^)(0-p(i)ll<'>"‘  sup  ||Z)"’(/o^)(t)||.  (3) 

Q<t<h  0<t<h 

Thus  conjecture  (•)  is  true  if  one  can  guarantee  in  addition  that  the  parame¬ 
ters  of  a  solution  of  (2)  remain  bounded  for  /i  — >  0.  This 

question  of  stability  is  also  discussed  in  [6].  Note  also  that  in  this  case  equa¬ 
tions  (1)  can  be  written  as 

p  o  0"^(si)  =  1  <  z  <  m, 

where  the  nodes  Si  are  defined  by  Sj  := 

The  most  interesting  case  of  the  conjecture  is  when  (n  -  l)/{d  -  1)  is  an 
integer  k,  i.e.  m  =  n+  l-hfc  =  fcd-f-2.  In  this  case  there  are  kd  equations  in 

(2) ,  and  the  degree  m  —  1  is  reduced  by  k.  Then  0  is  determined  by 

[ti,...,t„+i+i](/o0)  =  0,  2  =  (4) 

and  p{t)  by  the  first  n  -f  1  equations  in  (1). 

So  far  the  conjecture  seems  to  be  proved  only  in  the  case  n  =  d  or  k  =  1 
(see  [5-6])  and  for  A;  =  2  =  d  (see  [1,3]).  Here  we  treat  the  next  most  difficult 
cases  fc  =  3,d  =  2  and  fc  =  2,d  =  3,  which  amount  to  six  equations  in  (4), 
and  will  lead  to  quartic  curves  in  the  plane  and  quintic  curves  in  space  with 
approximation  rates  of  order  8,  respectively. 


§2.  Reduction  to  2  x  2  Systems 

In  Morken  -Scherer  [6]  equations  (4)  were  studied  in  particular  for  the  Taylor 
case 

£)"+‘(/o0)  =  O,  i  =  l,...,fc. 

The  crucial  point  is  then  the  formula  of  Faa  di  Bruno  in  the  form  given  by 
T.  Goodman.  It  reads  (cf.  [4]) 


where 


aij  = 


•••A, 


(5) 


/  1  _  _ 

■  Ijlmil  •  ■  ■  Tnr\ 


and 
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Here  the  integer  r  denotes  the  number  of  distinct  integers  among  the  /i, . . . ,  /j, 
where  mi, . . .  ,mr  are  the  multiplicities  of  them.  Specific  examples  are 


a/,1  =  A,  0,1^2 


«/,/  =  Pi- 


At  first  we  consider  the  planar  case  d  =  2  for  fc  =  3.  Then  the  equations  (4) 
specialize  to 

D^(/o0)(O)  =  O,  i  =  5,6,7.  (6) 

Under  the  normalization  ^(0)  =  0,  pi  :=  =  1,  this  yields  six  equations 

for  the  unknowns  p2,  In  [8]  this  system  has  been  reduced  to  a  2  x  2 

system  for  P2,p3-  The  basic  idea  was  to  simplify  system  (6)  by  determining  a 
preliminary  reparametrization  if)  with  '0(0)  =  0,  '0'(O)  =  1  such  that 

I|2‘+l(/oV')(0)  =  0,  i  =  l,2,3. 

It  can  easily  be  shown  that  this  system  is  uniquely  solvable  in  the  unknowns 
72j-”j77j  where  jj  :=  provided 

span(/'(0),/"(0))  =  lR".  (7) 

Thus  one  can  assume  that  T>^/(0)  =  £>^/(0)  =  in  (6).  Denoting  dij 

as  the  cross  product  of  and  in  i.e. 

dij  -  f}''>  X 

Further  straightforward  computation  (cf.  [9]  )  leads  to 

Lemma  1.  Under  the  assumption  (7)  and  the  normalization  0(0)  =  0,  pi  := 
0'(O)  =  1,  the  vector  ---iPi  is  a  solution  of  the  equations  (7)  iff  p2yP3  is  a 
solution  of  the  2x2  system 


0  =  dg.l  +  20^4,1^3  +  10^2, i/?3 

+  60/32^4^2  +  15(^4^!  —  2d2,l/?3)^2) 


(8) 


and 


0  =  15/52^4,1  +  75/32^4,2  +  (Sdg^i  —  lOd^  i/d2,i)/?2  +  dQ^2 

+  10(2^4,2  +  p2di,l)p3  -  10d2,lp2P3- 

The  elimination  of  the  variable  p^  in  this  system  along  the  lines  of  the 
resultant  method  yields  an  equation  of  degree  9  in  /32-  However,  it  was  over¬ 
looked  in  [8]  that  the  constant  term  in  this  equation  vanishes,  so  that  it  is  in 
essence  of  degree  8  .  Thus,  existence  of  a  (real)  solution  of  (8)-(9)  cannot  be 
derived  in  this  way.  We  will  close  this  gap  in  the  next  section. 
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In  the  space  case  =  3  we  have  to  find  a  reparametrization  (j)  such  that 

D^(/o(/))(0)  =  0,  i  =  6,7.  (10) 

We  simplify  this  again  by  determining  a  reparametrization  ^  with  'ip{0)  =  0, 
V^'(O)  =  1,  :=  ij  and 


^'(/o^)(0)=0,  j-4,7. 

This  is  possible  since  by  (5),  in  the  three  equations  forming  o  '0)(O)  =  0 
the  coefficients  of  and  f”  are  linear  in  the  unknowns  72,73,74,  and  in 

the  latter  three  are  linear  with  respect  to  the  75,  76,  77-  Therefore,  these 
equations  are  uniquely  solvable  under  the  assumption 

span(/'(0),/"(0),/'"(0))  =  ]R^  (11) 

and  we  can  consider  (10)  without  loss  under  the  assumption  Z?''(/)(0)  = 
D^[f){0)  =  0.  This  yields  (with  /J,  =  1)  the  equations 

0  =  06^  +  (Oft  +  loftft  + 10^3)/"  , 

+  (15ft  +  OOftft  +  15/?!)/'"  +  15ft/"^'  +  f}^\ 


0  =  ft/'  +  (7ft  +  21ftft  +  35ftft)/"  +  [Wh0l  +  35ft)/<®> 

+  (21ft  +  lOSftft  +  7001  +  105/3|ft)/"'  +  21ft/'®’. 

The  next  step  is  to  take  in  (12)  the  scalar  product  with  cross  products  /'  x 
X  and  f'  X  respectively.  We  obtain  the  equivalent  equations 

0  =  (15/?4  +  60/?2/?3  +  15)52)^1,2,3  +  15/32C?5,1,2  +  ^6,1^2  (14) 

0  =  (6/?5  +  I5P2P4  +  10/51)^2,1,3  +  15/?2C^5,1,3  +  ^6,1,3  (15) 

O  =  /56di,2,3  +  15/?2d5,2,3  +  ^6,2,3i  (10) 

where 

(;(h)  x/(y,/0))  ;=  det(/<0), /(•=), /(^’)  ;= 

These  equations  serve  for  eliminating  the  variables  (34, Ps  and  /^e  since  they 
appear  linearly.  Before  doing  this,  we  transform  (13)  into  three  equivalent 
scalar  equations  analogously  to  (12).  We  obtain  the  three  equations 

0  =  /37dl,2,3  +  (105/3|  4-  35/53)^5,2,3  +  21/12^6,2,3,  (l’^) 

0  =  (/36  +  Sp2p5  +  5/13)54)^2,1,3  +  (1552  +  553)c?5,l,3  +  352^^6,1,3,  (18) 

0  =  (355  +  155254  +  105|  +  155i53)^^3,l,2  +  (155|  +  553)c^5,l,2  +  352cl6,l,2.  (19) 
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Equation  (17)  determines  directly  in  terms  of  j32,l33.  Now  we  eliminate 
/95,/?6  in  (18)  via  (15), (16).  This  gives  at  first 

0  =  -(3/?2/35  +  5^53/34)^2,1,3  +  (15/3|  +  5/33)d5,i,3  +  3/32(d6,l,3  +  5d5^2,3)  +  C^6,2,3 
and  then 

15  5 

0  =  ■y/?2^1,2,3  +  5/53^5,1^3  +  -/?2(<^6,1,3  +  6^5, 2, 3)  +  C^6,2,3 
+  (™/32/?4  +  5/?3/?2)di,2,3  -  5/3354^^1,2,3- 
Now  we  eliminate  the  variable  ^4  by  (14).  The  result  is  the  equation 

0  =  go(52)  +  ^1(52)53  +  ^2(^2)  0i->  (20) 


where 

go(52)  :=  (15/2)di,2,35|  -  (l5/2)di,2,55|  +  [(15/2)di.3.5  -  (l/2)di.2,6]52 

+  (15^2,3,5  -  (5/2)di,3,6)52  +  ^2,3,6? 

^1(52)  :=  (l/3)dl,2,6  +  5di,3,5  +  5di,2,552  —  25^1,2,352? 

92(52)  :=  — 25ci!i, 2,352 • 

Analogously  we  reduce  equation  (19)  to  an  equation  in  52,53  by  eliminating 
55  and  then  54-  Using  (15),  we  obtain 

0  =  (5/3|  +  15/32/?3)di,2,3  +  (15^1  +  5/J3)d5,i,2 

+  3/32(<i6,l,2  +  2*^5, 1,3)  H - +  ■^/32/34'^1,2,3> 

and  then  by  (14) 

0  =  di,2.3(-i5/3^ + zmul  +  mi) + rf5, 1,2(15^1  +  10/J3) 

+  /52(15ci5,l,3  +  5<i6^i,2)  +  ^6,1, 3' 

In  order  to  get  rid  of  the  term  with  51  in  (21),  we  make  a  final  substitution 

p3~0z  +  a0i,  a  :=  (3/2)  +  vl5/2. 

Then 

-ml,  -  SOMi  +  10/3|  =  [-30^1  +  20a/3|]/33  +  10A^ 
and  (21)  simplifies  to 


0  =  Po(A)  +  Vl(0i)h  +  P2{P3)02> 


(22) 


380 


K.  Scherer 


where  _  _ 

Poif^s)  ■=  10c^1,2,3/53  +  10<il,2,5^3  +  <^1,3,6, 

PliPs)  •=  5^1, 2, 6  +  15cii,3,5,  (23) 

P2{Pz)  •=  10'\/l5(ii,2,3/^3  +  (15  +  10Q:)di,2,5- 

With  the  new  variable  /?3,  (20)  transforms  into 

0  =  go(/^2)  +  9i(^2)^3  +  Q2(P2)p3  »  (24) 

qo(p2)  '=  Qo{p2)  ~  25a^(ii,2,3^2  +  ocplqi{p2), 

Ql{p2)  ‘=  9l(/^2)  -  50di,2,3Q!/^2  5 
^2(^2)  ■=  h{p2)  =  -25(ii,2,3^2, 
and  the  qi{P2)  defined  as  above.  We  summarize  all  this  in 

Lemma  2.  Under  the  assumption  di,2,3  7^  0,  i.e.  assumption  (11),  and  the 
normalization  0(0)  =  0,pi  :=  0'(O)  =  1,  the  vector  P2,...,P7  is  a  solution  of 
the  equations  (22),  (24)  iff  P2^p3  Is  a  solution  of  the  2x2  system  (22),  (24). 

Remark:  The  systems  (8)-(9)  in  the  planar  case,  and  (22)-(24)  in  the  space 
case  possess  a  similar  structure.  In  (8)  and  (22)  the  coefficients  of  P2  are 
polynomials  of  the  same  degree  in  and  P21  respectively.  The  same  is  true  for 
(9)  and  (24),  except  that  the  corresponding  polymials  have  different  degrees. 


§3.  Existence  Theorems 

In  view  of  the  last  remark,  we  treat  in  detail  only  the  planar  case. 

Theorem  1.  The  system  (8)-(9)  has  at  least  one  and  at  most  5  (real)  solution 

pairs  p2,Ps  outside  the  line  ps  =  ^4, 1/2^2, i- 

Proof:  Let  us  write  for  shortness  x  :=  P2  and  y  :=  P3  as  well  as 


My)-=Po{y)^  2jB  :=  Pi (?/)  =  60^4,2,  (^(j/)  :=  ^2(2/)  =  15^4,1  -  30^2, il/- 


Then  (8)  reads  0  =  A{y)  +  2Bx  +  C(y)x^.  Formal  solution  for  x  gives 


X  =  (p±(p)  := 


-B±^B^^A{y)C{y)  _  -B  ±  y/^ 
C(y)  ■  C{y) 


(25) 


with  the  cubic  polynomial 

R{y)  =  15[20d|i?/^  +  30d4,i<^2.iy^  +  (2^6, 1^4,1  -  20dl_i)p  +  240d^,2  "  dAydey]. 

Then  write  (9)  as  0  =  ELo  +  ^o2/  +  hxy  +  b2xy'^,  and  insert  p5),  since 
by  assumption  C{y)  ^  0.  After  multiplication  with  C{y)^,  we  obtain 


0  =  y^ai{—B  ±  y/R{y)YC{y)^  ^■fboyC{y)^ 

i=0 


+  bi{—B  ±  y R{y))C{y)^  +  b2y^{—B  ±  ■\/R{y))C{y)^ . 


(26) 
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(-B  ±  =  B^  +  R{y)  ±  2Bv^, 

(-B  ±  =  -B^  ±  -  3BR{y)  ±  B(j/)v^, 


and  sort  all  terms  with  and  without  \/R{y),  respectively.  Then  (26)  can  be 
written  as 

U(y)  =  ±V(y)V^,  (27) 

where  U{y)  is  a  polynomial  with  leading  term  “11  *  15  •  •  9000^4^2  ^^2,1 
V{y)  also  of  degree  4  with  leading  term  9000^2,1 

Hence  under  the  above  assumption,  /?2,/?3  is  a  solution  of  the  system 
(8)-(9)  iff  2/  =  /^s  is  a  solution  of  (27)  with  sign  either  +  or  -  on  the  right 
hand  side.  Suppose  that  ni  and  n2  are  the  numbers  of  solutions  of  these  two 
equations  (including  multiplicities).  Then  the  squared  equation 

U\y)  =  V^{y)-R{y) 

is  of  degree  11,  and  has  2(ni  +  712)  solutions.  Hence  we  conclude  ni  +  712  <  5. 
To  prove  existence,  write  (9)  as 

H-(/?2,/?3)  =  0,  (28) 

where  H  is  of  degree  3  in  (32  and  with  —  10^2/?3d2,i  as  leading  term  in  (3z-  Then 
introduce  y*  as  the  largest  zero  of  R(y)^  so  that  in  view  of  R{-\-oo)  =  +00 

R(y*)  =  0,  R{y)  >  0  fory*  <  y  <  00. 

Now  insert  both  functions  <p±{y)  in  (28),  and  obtain  the  functions 

H+(y)  :=  H{My),y). 

In  order  to  guarantee  existence  of  a  solution  of  (8)-(9),  it  suffices  therefore  to 
show  that  the  ranges  of  jH+  satisfy 


//■+[2/*,oo)  UH_[2/*,oo)  =  K. 
For  this,  observe  at  first  the  properties 


(29) 


y,+{y*)  =  -B/C(y*)  =  2d4,2/(<i4.i  -  2y*  d^.i)  :=  7’* 


and 

(p+{y)  ^  ±|||^/2sign  {d2,i),  y  ^  oo. 
Then  we  distinguish  the  cases  (assume  without  loss  ^2,1  >  0): 

i)  y*  >y  :=  ^4, 1/2^2, 1,  ii)  y*  <  y. 
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In  case  i),  we  consider  the  ranges  of  (p+  for  {y*,oo),  and  have 

(p+[y*,oo)  =  (-00, (/?*],  (^_[?/*,oo)  =  [<p%oo).  (30) 

Since  both  </?+  are  well  defined  and  continuous  on  (?/*,  oo),  so  are  the  functions 
H^{y)^  and  furthermore 

H^{y*)  =  H{y^\y*). 

In  combination  with 

H+{y)  ^10\d2,i\\yf<p+{y),  y oo,  (31) 

it  follows  that 

H+[y*,oo)  =  (-oo,//'(y?*,  ?/*)],  H-[y\oo)  =  [H{ip\y*),oo), 

and  hence  the  desired  assertion  (29)  in  case  i). 

In  case  ii),  the  function  </?-(?/)  has  a  singularity  at  y  which  lies  in  {y* ,  oo). 
However,  we  can  restrict  its  domain  to  [y*,y)  and  still  have  (dg,!  >  0) 

^-[y\y)  =  (32) 

On  the  other  hand  the  function  (y)  remains  continuous  on  the  whole  inter¬ 
val  (y* ,  oo)  since 

My)  =  My)/{B  +  -  A(y)C{y)). 


Hence  we  have 

<^+[y*,oo)  =  (-oo,c,c*], 

so  that  together  with  (32)  we  have  the  same  situation  as  in  (30)  and  can 
proceed  further  exactly  as  before  in  order  to  prove  (29).  □ 

It  remains  to  discuss  whether  there  exist  solutions  of  (8)-(9)  if  = 
d4,i/2d2,i  :=  y-  In  this  case,  (8)  gives  /32  =  A{y)/B  if  B  /  0,  and  (9) 
can  have  a  solution  only  under  some  additional  constraint  on  the  parameters 
d2,i,d4, 1,^4, 2,^6, 1,^6, 2-  We  omit  it  here,  as  well  as  the  one  which  results  from 
(8)  if  in  addition  B  =  60^4^2  =  0. 

Further,  we  remark  that  there  can  indeed  exist  5  solutions  of  (8)-(9).  To 
this  end,  one  can  consider  the  case  ^4,1  =  0,  where  these  equations  simplify 
in  such  a  way  that  solving  (8)  for  z  ;=  /?3  —  (3/2)/52  gives 


2^4, 2/^2  ^ 
^2,1 


d6,l/^2  ^ 

30^2,1 


+  O{0^% 


Inserting  this  into  (9)  with  sign  +,  it  follows  that 

0  =  -90<i2,i/3|  +  255(^4,2/32  +  ^^6,102  +  *,2  +  0(1//32). 


Here  the  polynomial  of  degree  5  dominates  for  large  P2,  and  it  is  clear  that  the 
4  parameters  d2,i,d4^25d6, 25^6,1  can  be  chosen  such  that  there  exist  5  zeros 
outside  some  bounded  interval  containing  0.  The  situation  for  the  system 
(22)-(24)  in  the  space  case  is  similar. 
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Theorem  2.  The  system  (22)-(24)  has  at  least  one  and  at  most  7  (real) 
solution  pairs  /?2,  A  if  03  ^  “(6  +  \^)<ii,2,5/5A/l5c?i,2,3- 


Proof:  Concerning  existence,  the  argument  is  the  same  as  in  Theorem  1.  We 
define  x  :=  {32, y  :=  03  and  A{y)  :=  po{y),B  :=  pi{y),C{y)  :=  p2{y).  Then 
R{y)  in  (25)  has  the  form 


R{y)  =  50vT5rf?^2.3y^  +  50'/i5di,2  ,52/^  +  linear  term. 

Again  define  y  :=  -(6  +  \/l5)di,2,5/5V^di,2,3  as  the  zero  of  C{y)  and  y*  as 
the  largest  zero  of  R{y),  and  let  ip±{y)  :=  {B  ±  y^R{y))/C{y).  Its  asymptotic 
behaviour  is  described  by 


^  sgn 


00. 


(33) 


Now  we  distinguish  as  in  Theorem  1  the  cases  i)  and  ii)  and  conclude  that 
either  (30)  holds  or  (31),  respectively. 

Next  write  (24)  similarly  as  in  (28)  as 


H(P2,03)  =  O 

and  define  H±{y)  :=  H{ip±{y),y).  Its  asymptotic  behaviour  is  somewhat 
more  complicated  to  determine  than  in  (31),  since  by  the  definition  of  the 
qi{y),  i  =  0, 1,2,  in  (24),  the  leading  term  of  H{{32,03)  now  has  the  form 


(■y  ~  25Q;^)di^2,3^2  +  ^^Oi^i,2,302  +  (25di^2,3  —  50adi^2,3)/^3^2  ~  25^1,2,3/^2/53^- 


Inserting  (33)  with  ^3  =y,  one  derives  from  this 

^±(2/)  ~  const.  y  00. 

Hence  we  obtain  the  same  property  (29)  for  H±  as  for  H±  in  Theorem  1,  and 
existence  of  a  (real)  solution  pair  of  (22),  (24)  follows. 


In  order  to  show  the  bound  on  the  number  of  solutions,  we  proceed  again 
as  in  Theorem  1.  We  solve  (22)  for  x  =  {32  as  in  (25)  and  insert  it  into  (24) 
written  as 

5  3 


0  =  ^  aix^  +2/^1 


cxy 


i=0 


i=0 


with  constants  ai,bi  and  c.  This  gives  after  multiplication  with  C{y)^ 


0  =  '£ai(-B±^/W^yC{yf-' 

i=0 

3 

+yY,bi{-B±  vmyc{yy-‘ + ±  yw))c(vY. 


2=0 


Then  sort  by  terms  with  and  without  dl^JR[y)  .  The  result  is  an  equation 
of  type  (27),  this  time  with  polynomials  U{y)  of  degree  7  and  V{y)  of  degree 
6.  Therefore  we  conclude  by  the  same  argument  as  in  Theorem  1  that  the 
system  (22),  (24)  has  at  most  7  solutions  .  □ 

The  discussion  of  the  degenerate  case  ^^3  ^  -(6  +  \/l5)<ii,2,5/5\/l5di,2,3 
is  omitted.  It  can  be  done  similarly  as  for  Theorem  1. 
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§4.  Final  remarks 

We  have  shown  that  local  approximation  order  8  can  be  achieved  with  para¬ 
metric  polynomial  curves  of  degree  4  in  the  planar  case,  and  degree  5  in  the 
space  case.  The  method  for  this  consists  in  determining  a  suitable  reparame- 
trization  0,  and  then  the  Taylor-polynomial  with  respect  to  f^ocp.  For  practical 
purposes,  however,  it  is  important  to  consider  also  Lagrange  or  Hermite  inter¬ 
polation  in  the  sense  of  the  equations  (1).  This  has  been  done  in  case  k  ~  2 
for  d  =  2  in  [1,3,6]  and  for  d  =  3,  in  [5],  but  results  for  higher  k  do  not  seem  to 
be  available  so  far.  In  this  respect  another  interesting  open  question  is  which 
order  of  geometric  continuity  can  be  preserved  when  a  piecewise  polynomial 
curve  is  constructed  by  pieces  of  such  local  aproximations. 
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A  B-Spline  Approach  to  Hermite  Subdivision 


U.  Schwanecke  and  B.  Jiittler 


Abstract.  We  present  a  new  approach  to  Hermite  subdivision  schemes. 
It  is  based  on  the  observation  that  a  sequence  of  second  order  Hermite 
data  define  a  unique  interpolating  cubic  spline.  The  B-Spline  form  of 
this  interpolating  spline  leads  to  a  stationary  binary  subdivision  scheme 
with  4  different  subdivision  rules  for  the  control  points.  We  construct  a 
generalized  4-point  scheme  which  leads  to  a  new  family  of  Hermite 
subdivision  schemes. 


§1.  Introduction 

Starting  from  an  initial  sequence  of  second  order  Hermite  elements 

(i.e.  vectors  containing  function  values  and  associated  first  derivatives),  a  Her¬ 
mite  subdivision  scheme  (cf.  [4, 5, 6, 7])  of  order  two  recursively  generates  finer 
sequences  of  Hermite  elements  associated  with  the  dyadic  points 

The  refinement  is  based  on  two  rules, 

m  m 

j=0  j=0 

where  the  matrix  masks  . . . ,  Am^},  =  {Bq^\  . . . ,  of 

the  scheme  consist  of  real  2x2  matrices  depending  on  the  subdivi¬ 

sion  level  k.  Merrien  [7]  considered  Hermite-type  2-point-schemes  (i.e.  with 
m  =  1),  generating  functions.  By  introducing  an  auxiliary  point  subdivi¬ 
sion  scheme,  Dyn  and  Levin  [4,5]  analyzed  stationary  Hermite-interpolatory 
subdivision  schemes  of  arbitrary  order.  Using  this  approach,  Kuijt  [6]  con¬ 
structed  several  Hermite  interpolatory  subdivision  schemes  of  order  two. 
Kuijt  derived  the  refinement  rules  by  considering  the  polynomials  interpolat¬ 
ing  neighboring  Hermite  elements,  and  sampling  Hermite  data  from  them. 

By  considering  the  interpolating  splines  associated  with  the  Hermite  ele¬ 
ments,  this  paper  introduces  a  new  approach  to  Hermite  subdivision.  We  ana¬ 
lyze  the  smoothness  of  the  limit  function,  and  present  a  family  of  Hermite 
subdivision  schemes  generalizing  the  4-point  scheme  [3].  This  spline-based 
approach  can  be  generalized  to  Hermite  elements  of  arbitrary  order. 
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§2.  Spline  Subdivision  Schemes 

At  each  subdivision  level  k,  the  Hermite  data  define  a  unique  in¬ 

terpolating  cubic  spline,  having  the  B-spline  representation 

With  knots  T<'=)=(.... ...)•  (2) 

*  2X  2X 


The  control  points  G  IR  are  associated  with  the  Greville-abscissas  (see 

e-g’  [8])  -  sif  an'*  ^2i|i  =  +  ak'  to™ing  a  nonuniform 

sequence.  Control  points  and  Hermite  elements  are  related  by  the  transfer- 
mations  {vif  and  ,  where 

^^''=>  =  (1  “#)■  =  32^1)-  (3) 


Clearly,  the  spline  function  can  be  represented  with  respect  to  the  refined 

knot  vector  Knot  insertion  leads  to  the  following  4  refinement  rules 

for  the  B-Spline  control  points: 

Ak+i)  __  3  (k)  1  (fc)  ~ik+i)  _  1  (fc)  5  (fc)  .  2  (fc) 

P4i  —  4^21  +  4^21+1?  PAi+2  ~  8^2i  +  8^21  +  1  ^  8^21+2’ 


^:(fc+l)  _  iJfc)  ,  3„(fc)  AM)  _ 

P4i+1  “  4P2i  +  4^2i-|-l’  P4i+3 


2„W  .Sjfc)  , 

8P2i+l  8^^2i+2  ^  8^2i+3' 


(4) 


The  affine  combinations  (4)  describe  a  4-rule  stationary  binary  subdivi¬ 
sion  scheme  for  the  (nonuniformly  parametrized)  B-spline  control  points.  This 
scheme  generalizes  the  splitting  step  of  a  binary  uniform  subdivision  scheme. 
The  sequence  of  control  polygons  converges  to  the  limit  function 
Generalizing  (4)  leads  to  the  notion  of  a  spline  subdivision  scheme: 

Definition  1.  A  spline  subdivision  scheme  5(a°, a^,  a^, a^)  with  the  coeffi¬ 
cient  masks  a^  =  •  •  •  >®2m+i)»  generating  a  sequence  of  cubic  spline 

functions  is  given  by  the  four  subdivision  rules 

2m+l 

Piit^  =  E  '  **  =  0, 1. 2, 3  ,  fc  =  0. 1, 2, . . . .  (5) 

j=0 

With  the  help  of  the  transformations  (3),  the  matrix  masks  of  Hermite  sub¬ 
division  schemes  (1)  can  be  transformed  into  the  coefficient  masks  of  spline 
subdivision  scheme  (5),  thus  motivating  the  following  definition. 


Definition  2.  A  Hermite  scheme  is  said  to  be  stationary  if  the  matrices  Aj  := 
ff{k+i)  yk)  Bj  ■.=  are  constant  for  all  k  {j  = 

0, . . .  ,m).  The  coefficients  a^  of  the  associated  spline  subdivision  scheme  are 
obtained  from 


— 


^2j 

4j 


^2j+l 

A 

^2j  +  l 


®2j  +  l  ^ 
®2j  +  l  / 


Consequently,  every  stationary  Hermite  subdivision  scheme  is 

equivalent  to  a  spline  subdivision  scheme  *S'(a^,  a^,  a^, a^).  Note  that  a  spline 
subdivision  scheme  can  also  be  seen  as  a  special  matrix  subdivision  scheme 
(see  [1])  acting  on  vectors  of  2  consecutive  control  points. 
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§3.  Convergence  Analysis 

In  the  sequel  we  generalize  the  approach  introduced  by  Dyn,  Gregory  and 
Levin  [2]  to  the  spline  subdivision  case.  Consider  a  spline  subdivision  scheme 
«5'(a^, a^, a^, a^)  on  the  finite  domain  [0,n]  G  IR.  The  scheme  is  well  defined 
on  this  domain,  for  all  fc  >  0,  if  the  associated  Hermite  elements  at  stage  k 
are  defined  on  the  set  €  Zfe}?  where 


Zfc  =  +  m 


2m  -  1  if  Am  /  0 
2m  —  2  if  Am  ~  0. 


The  spline  function  (2)  has  the  knots 


(t)  M)  Ak) 

2*“n+ni  ’  ^2*^n+ni  ’  *'2*=7i+ni+l 


) 


and  the  control  points  (pf  ^)i=o,...,2(2fcn+ni)+i- 

Consider  an  interval  at  the  A;-th  subdivision  step.  The 

control  points  which  govern  the  future  behavior  of  the  process  in  this  interval 
are  gathered  in  the  vector  pi  k  =  (P2i\ •  ■  •  ,P2t>ni+i)+i)^- 

The  control  point  vectors  p2i,fc+i,  P2i+i,fc+i  at  the  subdivision  level  k  +  1, 
associated  with  the  subintervals  obtained  from  pi^fc  by 

two  linear  transformations, 


P2i,fc+1  —  ^0  Pi,k  and  P2i+1,A:+1  =  <^1  Pi,fc 


(6) 


with  Go  =  “  ^(i”'m-2)’  where  G(V;;>)  is  the  matrix 

comprised  of  the  elements  of  G,  at  rows  ii  <■••<  ip  and  columns  Ji  <■■  ■<  jp. 
These  linear  transformations  are  expressed  as  submatrices  of  the  M  x  M 
generator  matrix  G,  where  M  =  2(ni  +  3).  If  Am  /  0,  then  M  =  4(m  +  1), 
and  we  get  the  generator  matrix 


/Ao 

‘  ■  ■  Am 

0 

. 

Bo 

Bm 

0 

. 

0  ’ 

0 

Ao 

. 

0  ••• 

0 

G  = 

0 

Bo 

. 

Bm 

0  ••• 

0 

(V 

0 

. . . 

O  • 

O 

,  .  . 

■  ■  ■  Am 

0 

\  0 

0  Bo 

•  •  *  Bm 

0/ 

Otherwise,  if  Am  =  0,  M  =  4m  +  2  and  the  generator  matrix  is  as  above  but 
with  the  last  two  rows  and  columns  deleted. 


3.1.  Continuity 

The  following  necessary  condition  is  analogous  to  [2,  Prop.  2.3].  Alternatively 
it  can  be  formulated  using  the  eigenstructure  of  the  masks  of  the  associated 
matrix  subdivision  scheme,  cf.  [1]. 
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Proposition  3  (Affine  invariance).  A  necessary  condition  for  the  uniform 
convergence  of  the  spline  subdivision  process  to  a  continuous  nonzero  limit 
function  on  [0,n],  for  arbitrary  initial  data,  is  that  =  1  for  all 

h  =  0,l,2,3. 

In  order  to  analyze  the  convergence  to  a  continuous  limit  function,  we  exam¬ 
ine  the  difference  scheme  AS'(a°,a^a2,a^)  generating  the  differences  = 
If  the  necessary  condition  of  Proposition  3  is  satisfied,  then  this 
process  can  again  be  described  with  the  help  of  another  generator  matrix 

(s) 

which  is  obtained  using  the  upper  triangular  matrices  Em  =  +  <51+1,;) 

and  [EmY^  =  (-Ef=ro'<5i+h,i),  cf,  [2,  Prop.  3.2].  The  M-3  differences 
governing  the  future  behavior  of  the  process  in  the  interval  are  again 
collected  in  a  vector  =  [^2i^  •  •  •  ’  ^2(t-+ni+i)l^'  analogues  of  the 

transformation  (6)  are 

A2i,fc+i  =  and  A2i+i,fc+i  =  (9) 

where  Co  =  C(\Z^Zl)  and  Ci  =  Note  that  the  row  and  column 

ranges  of  the  sub-matrices  Co,  Ci  are  diferent  from  those  in  [2],  as  we  analyze 
a  difference  process  with  4  (rather  than  2)  rules.  We  get  (cf.  [2,  Theorem  3.1]) 

Theorem  4.  Let  the  spline  subdivision  process  satisfy  the  necessary  conver¬ 
gence  condition  of  Proposition  3.  Then  the  following  are  equivalent: 

(i)  The  spline  subdivision  process  converges  uniformly  to  a 

continuous  limit  function  on  [0,n]  for  arbitrary  initial  data. 

(n)  The  difference  process  A5(a‘’, a\ a^,  a^)  is  contracting,  i.e.  it  converges 
uniformly  to  zero  on  [0,n]  for  arbitrary  initial  data. 

(in)  There  exists  an  integer  L  >  0  and  a  real  number  0  <  a  <  1  such  that 
IlC'ii  •••C'ii.lU  <  0!,Vij  e  {0,1}  andj  =  1,...,L. 

In  the  sequel  we  have  to  analyze  other  point  processes  with  four  different 
refinement  rules.  The  continuity  of  the  limit  function  can  then  be  analyzed  in 
an  analogous  way,  where  the  generator  matrix  is  obtained  as  in  (8). 

3.2.  Derivative  process 

In  order  to  investigate  the  differentiability  of  the  limit  function  /,  we  analyze 
the  first  derivative  of  the  cubic  C^  splines  (2).  Clearly,  we  obtain  a  sequence  of 
quadratic  C°  splines  with  knots  T(^\  see  Figure  1.  If  the  necessary  condition 
of  Prop.  3  is  satisfied,  then  the  quadratic  splines  are  generated  by  another 
spline  subdivision  scheme,  again  with  four  different  rules  for  the  control  points. 
This  scheme  will  be  called  the  derivative  scheme  ^5'(a°,  a\  a^,  a^). 
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Fig.  1.  Derivative  scheme  and  inscribed  polygon  process. 

Proposition  5.  If  the  derivative  process  dS{aP,aA,B?,a^)  converges  uni¬ 
formly  to  d  E  C[0,n],  then  the  spline  subdivision  scheme  S(a^,a^,a^,a^) 
converges  uniformly  to  f  E  and  f  =  d. 

Using  similar  techniques  as  in  Section  3.1,  we  define  control  point  vectors  and 
a  generator  matrix  V.  We  omit  the  details,  giving  only  the  main  result: 

Proposition  6.  The  derivative  scheme  55(a°,a^,a^,a^)  has  the  (M  -  1)  x 
(M  —  1)  generator  matrix 


with  E]^  =diag(l,2,l,...,2)EM  and  =  {Em)  ^  diag(l,  i,  1, . . . ,  i). 

The  continuity  of  the  limit  function  generated  by  the  derivative  scheme 
can  now  easily  be  analyzed  as  in  Section  3.1  by  discussing  the  associated 
difference  scheme  Ad5(a®,  a^,  a^,  a^).  This  leads  to  criteria  for  continuity 
of  Hermite  subdivision  schemes. 


3.3.  convergence  analysis  via  inscribed  polygons 


In  order  to  examine  higher  order  continuity,  we  inscribe  a  polygon  into  the 
quadratic  spline  and  analyze  the  resulting  subdivision  scheme,  called  the 
inscribed  polygon  process  P55'(a®,a^,a^,a^).  More  precisely,  at  the  subdi¬ 
vision  level  k,  we  consider  the  piecewise  linear  function  with  the  vertices 
see  Figure  1. 

Proposition  7.  The  inscribed  polygon  process  PdS{a^,a^,a'^,a^)  of  the 
derivative  scheme  has  the  generator  matrix  P  =  Lm-i  T>  {Lm-i)~^  which  is 
obtained  using  the  {M  —  1)  x  (M  —  1)  auxiliary  matrices 
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The  derivative  and  the  inscribed  polygon  processes  are  equivalent: 

Lemma  8.  The  derivative  process  dS(a^,  ,  a^,  a^)  converges  uniformly  to  a 
continuous  limit  function  f  on  [0,  n\  if  and  only  if  the  inscribed  polygon  process 
PdS{aP ,  a.^ ,  ,  a^)  converges  uniformly  to  a  continuous  function  g  E  C[0,n]. 

Moreover,  f  —  g- 

Proof:  This  equivalence  is  due  to  the  convex  hull  property  of  B-splines,  and 
to  approximation  properties  of  interpolating  quadratic  splines.  □ 

Using  the  inscribed  polygon  process,  we  are  now  able  to  discuss  the  con¬ 
vergence  of  the  spline  subdivision  scheme  to  limit  functions  with  higher  or¬ 
der  differentiability,  by  the  same  analysis  as  in  the  non-Hermite  case  (point 
schemes).  We  simply  have  to  analyze  the  divided  difference  processes 
PdS{a^ ,a^ ,a^ ,3?)  (see  [2,  Theorem  4.2])  of  the  inscribed  polygons,  as 
follows. 

Theorem  9.  If  the  l-th  order  divided  difference  scheme  of  the  inscribed  poly¬ 
gon  process  PdS{a^,a^,a‘^,a^)  exists  and  converges  uniformly  to  fi  G  C[0,n], 
then  also  the  divided  difference  processes  PdS{a^ ,a^ ,a^ ,a^)  exist  and 
converge  uniformly  to  f,,  E  (7^“'^[0, n]  for  v  =  0,1,...,/,  and  =  /y. 
Hence,  the  spline  subdivision  scheme  S(a^,a^,a^,a^)  converges  uniformly  to 
g  E  with 

For  instance,  in  order  to  prove  that  the  limit  function  generated  by  the  spline 
subdivision  scheme  is  (7^,  the  difference  process  ADPdS{a^,a^,a^,a^)  of  the 
divided  difference  scheme  has  to  be  shown  to  be  contractive,  analogously  to 
Section  3.1.  The  divided  difference  scheme  DPdS{a^ ,  a^ ,  a^ ,  a^)  has  the  gen¬ 
erator  matrix  D  =  2Em-i  P  ({'"m-D ’  From  this  matrix  we  get  the 

generator  matrix  C*  =  Em-2  D  {{'"m-s)  associated  differ¬ 

ence  scheme  ADPdS{a^,  a^,  a^,  a^).  In  order  to  guarantee  a  (7^  limit  function, 
the  matrix  norms  ||(7j*  •  •  ■  (7f^  ||ooj  V  ij  E  {0, 1}  and  j  =  1, . . . ,  L  have  to  be 
less  than  1  for  some  L,  where  Cq  —  ^*(i  ’mIs)?  ^rid  Cl  = 

§4.  A  Generalized  4-Point  Scheme 

Based  on  a  geometric  construction,  Dyn,  Gregory,  and  Levin  [3]  derived  a 
family  of  interpolating  4-point  schemes.  This  family  can  also  be  obtained 
from  an  optimization-based  approach,  as  follows.  Let  the  subdivision  scheme 
generate  a  sequence  of  piecewise  linear  functions  with  knots  and 
control  points  In  order  to  derive  the  refinement  rules,  we  replace  one 

segment  of  with  two  new  ones  (shown  as  dashed  lines  in  Figure  2,  left), 
subject  to  boundary  conditions.  The  new  vertex  is  placed  by  min¬ 

imizing  the  jumps  of  the  first  derivatives  between  new  and  old  polygons.  In 
fact,  minimizing  the  weighted  linear  combination 
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Fig.  2.  Weights  of  objective  functions  in  the  point  (left)  and  spline  (right)  case. 

of  squared  differences  of  derivatives  produces  exactly  the  refinement  rules  of 
the  interpolating  4-point  scheme. 

This  approach  can  be  generalized  to  the  spline  case.  In  order  to  derive  the 
refinement  rules,  we  replace  one  segment  of  with  two  new  ones  (shown 
with  dashed  control  polygons  in  Figure  2,  right),  subject  to  boundary  con¬ 
ditions.  The  inner  new  control  points  ^4^+2^  placed  by  minimizing 

the  jumps  of  the  second  derivatives  between  new  and  old  splines.  Minimizing 
the  weighted  linear  combination 

+  8a-  [#■^'44+1)-^+ H4+i)  1^ 

of  the  squared  differences  of  the  second  derivatives  gives  the  refinement  masks 

a"  =  (0, 0,  f ,  0, 0, 0, 0),  ai  =.  (0, 0,  i,  0, 0, 0, 0), 
a^  =  (0, 1  -h  —  I  -  2a;,  |  -f  3a;,  |  —  3a;,  -f  2a;,  |  -  a;,0), 

I  “  “i  +  I  ~  4  +  3a;,  -  2a;,  I  +  a;,  0), 

see  (5).  In  order  to  analyze  continuity  of  the  limit  function,  we  compute 

the  generator  matrix  of  the  difference  process  ADF55'(a®,  a^,  a^,  a^),  which 
has  to  be  contractive.  Using  the  techniques  of  Section  3.1,  we  estimate  the 
convergence  range  of  the  parameter  a;,  see  Figure  3. 


-0.01! 


matrix  norms  obtained 
by  considering  the  composition 
of  6  steps,  i.e.  L=  6 


C  convergence  range  (marked  in  grey) 


yi  6.04  0.05  ^ 


Fig.  3.  Estimating  the  convergence  range  of  the  generalized  4  point  scheme. 
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Fig.  4.  Interpolatory  limit  functions  (left)  and  derivatives  (right). 

Two  limit  functions  interpolating  three  given  Hermite  elements  have  been 
drawn  in  Figure  4  (left).  The  functions  have  been  generated  with  the  help 
of  Merrien’s  scheme  (a  =  0.2,  dashed  curve,  cf.  [4,7]),  and  using  the 
generalized  4-point  scheme  (a;  =  0.015,  solid  curve).  As  can  clearly  be  seen 
from  the  associated  first  derivatives  (right),  the  generalized  4-point  scheme 
produces  a  (7^  limit  function. 
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An  Interpolation  Method  with  Weights  and 
Relaxation  Parameters 

Chiew-Lan  Tai,  Brian  A.  Barsky,  and  Kia-Fock  Loe 


Abstract.  This  paper  presents  a  new  interpolation  method  that  is  based 
on  blending  a  nonuniform  rational  B-spline  (NURB)  curve  with  a  singu¬ 
larly  reparametrized  (SR)  linear  spline.  The  resulting  curve  is  called  the 
a-spline.  It  has  weights  and  relaxation  parameters.  Given  the  data  points 
to  be  interpolated,  a  NURB  curve  is  obtained  by  using  these  data  points 
as  its  control  points.  The  SR  linear  spline  is  then  determined  by  imposing 
constraints  on  the  a-spline  to  interpolate  the  data  points.  The  a-spline  is 
parametrically  continuous.  It  involves  only  simple  computation,  and  does 
not  require  solving  linear  systems.  This  approach  is  extended  to  produce 
interpolatory  surfaces,  and  can  be  used  as  a  modeling  tool  for  deforming 
polygonal  shapes  into  smooth  spline  surface  models. 


§1.  Introduction 

Cubic  interpolating  splines  are  known  to  exhibit  undesirable  oscillations  due 
to  “extraneous”  inflection  points.  This  undesirable  property  motivated  the 
introduction  of  tension  applied  to  interpolating  splines.  Barsky  analyzed  two 
well-known  approaches  for  applying  tension  to  interpolating  curves  [1].  The 
first  approach  is  the  exponential-based  spline  in  tension  [12];  the  second  is  the 
more  efficient  polynomial  alternative,  the  i/-spline  [11].  The  tension  parame¬ 
ters  are  associated  with  data  points  in  these  splines. 

Tension  has  also  been  introduced  as  a  shape  parameter  to  non-interpo¬ 
lating  splines.  The  Beta-spline  [2]  incorporates  bias  and  tension  parameters, 
based  on  the  theory  of  geometric  continuity.  Another  spline  technique  that 
has  bias  and  tension  parameters  (and  also  a  “continuity”  parameter)  was 
proposed  by  Kochanek  and  Bartels  [8].  Unlike  those  in  the  Beta-spline,  these 
parameters  control  only  the  first  derivatives,  and  the  resulting  curve  is  and 
interpolatory.  The  weighted  splines  by  Foley  [6]  provide  tension  control  on 
curve  segments  between  interpolation  points,  rather  than  at  the  interpolation 
points. 
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In  addition  to  spline  methods  that  allow  the  user  to  modify  the  shape  pa¬ 
rameters,  a  related  research  area  is  the  automatic  determination  of  shape  pa¬ 
rameters  to  produce  curves  and  surfaces  that  satisfy  certain  shape-preserving 
criteria  [5,7].  Another  more  subjective  criterion  used  is  fairness,  or  “visual 
appeal” . 

NURBS  are  also  often  used  to  solve  interpolation  problems.  However,  us¬ 
ing  the  NURB  as  a  surface  interpolant  requires  performing  0{mn)  operations 
to  solve  the  0{m  -f  n)  linear  systems  for  a  dataset  of  m  x  n  points.  Although 
these  systems  are  tridiagonal,  they  still  incur  substantial  computations.  More¬ 
over,  the  interpolation  method  is  global;  thus,  changes  to  any  data  point  will 
require  solving  again  all  the  linear  systems. 

The  idea  of  using  singular  blending  to  solve  the  interpolation  problem,  with 
tension  control,  was  proposed  by  Loe  and  Tai  [10].  In  this  paper,  we  use  this 
idea  to  propose  another  interpolation  method  that  has  weights  and  relaxation 
parameters.  A  high  relaxation  value  has  the  effect  of  reducing  tautness.  The 
method  is  based  on  blending  a  NURB  curve  with  a  sequence  of  line  segments, 
or  a  linear  spline,  that  is  reparametrized  such  that  each  of  the  line  segments 
has  zero  derivatives  at  both  ends. 

This  work  is  closely  related  to  some  earlier  work  of  Coons  [4].  He  used 
this  blending  technique  to  modify  any  given  piecewise  curve  by  letting  the  line 
segments  be  those  connecting  the  joints  of  the  given  curve,  and  proved  that  if 
the  given  curve  is  then  the  blending  function  only  has  to  be  for  the 

resulting  curve  to  retain  the  continuity  of  the  original  curve.  More  precisely, 
Coons’  aim  was  not  to  solve  the  interpolation  problem:  for  the  new  curve  to 
be  interpolatory,  the  original  curve  has  to  be  interpolatory. 

§2.  Singular  Blending 

Given  a  smooth  piecewise  curve  C{u),  a  tension  control  can  be  introduced  by 
blending  the  smooth  curve  with  a  linear  spline  that  approximates  the  curve 
[4,9].  This  linear  spline  can  simply  be  obtained  by  connecting  the  joints  of  the 
smooth  curve,  but  in  general  it  need  not  interpolate  the  smooth  curve.  Let 
L{u)  denotes  a  linear  spline.  The  blending  then  gives 

Q{u)  =  {1  -  a)C{u)  +  aL{u),  uG[uo,u„],  0  <  a  <  1.  (1) 

It  is  easy  to  see  that  as  a  increases,  the  contribution  of  L{u)  to  Q{u)  increases; 
thus,  the  resulting  curve  is  more  taut,  simulating  a  higher  tension. 

Assuming  that  the  smooth  curve  C{u)  is  (generalization  to  higher 
order  curves  is  straightforward),  then  for  Q{u)  to  retain  this  continuity,  in 
general  L{u)  must  be  at  least  C^.  We  define  L{u)  as 

L{u)  =  (1  -  s{t))V j  +  s{t)V j+i,  u  e  luj,uj+i),  (2) 

where  t  =  j  =  0?  are  the  vertices  of  the  linear  spline,  and 

s(i)  is  a  monotonically  increasing  function  yet  to  be  defined.  Since  L{u)  must 
be  it  must  satisfy  the  following  conditions: 

L'[uj)  =  L\uj^i)  —  0  and  L"{uj)  =  L”{uj^i)  =  0. 
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These  conditions  can  be  satisfied  by  letting  s{t)  be  the  quintic  Hermite  polyno¬ 
mial  — 15£^  4-6t®  [3].  We  call  a  linear  spline  that  satisfies  these  conditions 
a  singularly  reparametrized  (SR)  linear  spline. 

For  the  technique  to  be  useful,  it  should  have  local  tension  control;  that 
is,  the  a  in  (1)  must  be  a  function  of  u, 

Q{u)  =  (1  -  a(^i))C'(w)  I-a{u)L{u),  u  G  [uQ,Un]-  (3) 

The  Q:(7i)  function  must  satisfy  three  criteria:  (1)  interpolate  the  local  parame¬ 
ters  aj,  j  =  0, ...,  n;  (2)  employ  a  local  interpolation  method  so  that  modifying 
a  particular  aj  will  affect  only  the  neighboring  curve  segments;  (3)  be  at  least 
so  that  the  blended  curve  is  also  (Coons  [4]  showed  that  if  the  SR  line 
segments  are  restricted  to  those  connecting  the  joints  of  the  given  curve,  then 
a(u)  only  need  to  be  C^).  We  observe  that  s{t)  can  be  used  to  define  q;(u), 
with  all  three  criteria  satisfied;  hence,  we  define 

q;(u)  =  (1  -  s{t))a*  s(t)Q;J+i,  u  e  [uj,Uj+i),  (4) 

where  t  =  We  then  have  a'{uj)=a'(uj^i)  =  a”{uj)=a”{uj+-i)  =  0. 

A  drawback  of  this  definition  of  a{u)  is  that  drastic  changes  in  adjacent  a] 
values  can  cause  the  curve  to  undulate.  An  alternate  method  is  to  estimate 
the  first  and  second  derivatives  at  the  joints  by  some  approximation  method, 
and  obtain  Q:(n)  by  Hermite  interpolation. 

This  idea  of  singular  blending  can  be  applied  to  many  combinations  of 
curves  and  SR  linear  splines.  Since  NURBS  are  prevalent  in  industry,  we 
let  the  smooth  curve  C{u)  be  a  NURB  curve  and  call  the  resulting  curve 
the  o-spline.  If  the  SR  linear  spline  is  the  control  polygon  of  the  NURBS 
curve,  then  the  resulting  a-spline  is  non-interpolatory.  The  non-interpolatory 
a-spline  includes  the  NURB  as  a  special  case;  when  all  tension  parameters 
are  zero,  the  a-spline  reduces  to  a  NURB.  Some  other  geometric  properties 
inherited  from  NURB  include  convex  hull,  affine  and  projective  invariance, 
and  local  control. 


§3.  The  Interpolating  a-spline  Curve 

The  a-spline  is  non-interpolatory  when  the  SR  linear  spline  is  the  control 
polygon  of  the  NURB  curve.  However,  there  is  no  reason  to  restrict  the  SR 
linear  spline  to  be  the  control  polygon.  In  this  paper,  given  a  NURB  curve, 
we  will  determine  a  new  SR  linear  spline  such  that  when  they  are  blended, 
the  resulting  a-spline  interpolates  a  given  set  of  data  points. 

Let  the  data  points  be  Pj,j  =  0,  ...,n.  The  cubic  NURB  curve  must 
somehow  approximate  the  data  points.  A  simple  way  to  achieve  this  is  by 
letting  the  given  data  points  serve  as  the  control  points.  Next,  we  must 
introduce  two  new  control  points  so  that  the  number  of  NURB  curve  segments 
is  n,  where  each  curve  segment  corresponds  to  an  interval  between  two  data 
points.  Since  we  want  the  NURB  curve  to  interpolate  the  endpoints  Pq  and 
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Pn,  we  let  the  new  points  be  coincident  with  the  two  endpoints,  and  set  the 
first  four  and  last  four  end  knot  values  to  be  equal;  that  is, 

P_i  =  Po  and  P„+i  =  P„, 

and  U-s=u-2=u-i  =  uo,  and  Un=Un+i=Un^2  =  Un+3  in  C/’={k_3,  ...,Un+3}- 
The  interior  knot  values  uo,...,Un  are  computed  using  methods  such  as  the 
chord-length  or  centripetal  method.  All  the  examples  in  this  paper  use  the 
centripetal  parametrization.  The  cubic  NURB  curve  sequence  is  given  by 


n+l 

C'(u)  =  ^  u6[wo,u„], 

j  =  -l 


where  Rj^4{u)  are  the  cubic  rational  B-spline  basis  functions  defined  over  the 
knot  vector  U. 

To  determine  the  unknown  vertices  Vj,  j  =  0,  ...,n,  of  the  SR  linear 
spline,  we  impose  constraints  on  the  resulting  a-spline  Q{u)  to  interpolate  all 
the  data  points:  Pj  =  Q(uj)^j  =  0,  Substituting  Q{uj)  from  (3), 


Pj  =  (1  -  a{v.j))C{uj)  +  a{uj)L{uj)  =  (1  -  a])C{uj)  +  a]V j. 

*  P 

Solving  for  V then  adding  Pj  and  subtracting  ^  yields  equations  that 
require  only  simple  computations: 


vj  =  ^  p.  +  L^(p.  _  c{uj)). 


Defining  pj  = 

i 


yields 


Vj  —  Pj  +  Pj  {Pj  C{uj))j  j  —  0,...,n.  (5) 


We  call  pj  the  relaxation  parameters.  The  geometric  interpretation  of  (5) 
is  that  C'(tij),  Pj,  and  V j  are  collinear,  and  the  distance  between  C{uj)  and 
Pj,  and  between  Pj  and  Vj  are  in  the  ratio  1  :  Pj.  We  know  that  pj  >  0, 
because  aj  <  1.  From  empirical  study,  we  have  found  0  <  p j  <  4  to  be  a 
useful  range.  The  midpoint  value  pj  —  2  yields  visually  appealing  shapes  for 
most  datasets;  thus,  we  use  that  value  as  the  default  relaxation  value.  The 
effect  of  Pj  can  be  interpreted  from  noting  that  a*  =  and  observing 

the  role  of  a  in  (3).  The  value  pj  =  0  corresponds  to  the  maximum  tension 
Qj  =  1;  when  all  pj  —  0,  only  the  SR  linear  spline  contributes  to  the  a-spline 
and  we  have  a  linear  interpolant.  When  the  relaxation  value  pj  increases, 
a*j  decreases,  and  the  contribution  of  the  SR  linear  spline  to  the  a-spline 
decreases;  thus,  the  resulting  curve  is  being  relaxed  locally.  By  using  different 
relaxation  values,  we  can  easily  obtain  rounder  or  sharper  corners  without 
specifying  multiple  knots  or  multiple  control  points.  Fig.  1  shows  the  effect  of 
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Fig.  1.  Effect  of  global  relaxation:  (left)  1,  (right)  2. 


Fig.  3.  Effect  of  varying  w^:  2,  0.5;  the  other  wj  are  unity.  All  pj  are  2. 

applying  the  relaxation  globally.  It  can  be  observed  that  while  the  SR  linear 
spline  found  is  dependent  on  the  global  relaxation  value  specified,  the  cubic 
NURB  curve  remains  fixed.  Fig.  2  shows  the  effect  of  varying  the  relaxation 
locally  (while  the  other  part  of  the  curve  has  the  default  relaxation  value). 
Note  that  the  effect  of  each  pj  is  very  localized;  only  the  nearest  two  curve 
segments  are  affected. 

The  effect  of  the  weight  is  less  obvious.  Each  Wj  affects  four  neighboring 
segments,  i.e.,  [uj-2iUj+2]-  When  Wj  decreases,  C{uj)  moves  further  from  Pj 
(assuming  pj  is  fixed);  thus,  from  (5),  V j  found  is  further  from  Pj.  Since 
the  weights  are  relative  in  nature,  decreasing  Wj  also  causes  <7(uj_i)  and 
C{uj+i)  to  move  closer  to  the  edge  Pj-2Pj-i  and  Pj+iPj+2?  respectively, 
and  to  be  on  that  edge  when  Wj=0.  Hence,  when  Wj  =  0,  from  (5)  again, 
Pj_2,  C{uj-i),  Pj-i,  and  Vj-i  are  all  collinear,  and  so  are  Pj+2f  C(uj+i), 
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Pj+i,  and  Vj+i.  This  effect  on  the  SR  linear  spline  is  depicted  in  Fig. 3; 
only  W5  is  varied  here.  To  summarize,  decreasing  wj  causes  the  interpolating 
a-spline  to  be  rounder  near  Pj,  but  causes  its  segments  between  Pj_2Pj_i 
and  Pj^iPj^2  to  be  more  taut  near  the  points  Pj_i  and  Pj+i. 

The  a-spline  Q{u)  is  clearly  continuous  since  a{u),  L{u),  and  C{u) 
are  all  C^.  The  first  and  second  derivatives  at  the  knots  are  as  follows: 

Q’iuj)  =  (1  -  a*)C'K)  +  a'{uj){L{uj)  -  C{uj)), 

Q"(uj)  =  (1  -  a*)C"(uj)  -  2a'(uj-)C'(uj)  +  a"(uj)(L(u,)  - 

If  a(u)  is  defined  as  in  (4),  then  Q'(uj)  =  (1  -  aj)C'(uj)  and  Q"(uj)  = 
(1  —  aj)C"(uj).  That  is,  the  derivatives  at  the  joints  of  the  blended  curve 
Q(u)  are  in  the  same  directions  as  their  counterparts  of  C(u). 

§4.  The  Interpolating  a-spline  Surface 

Analogous  to  the  blending  of  a  NURB  curve  with  an  SR  linear  spline,  we 
can  blend  a  NURB  surface  with  a  network  of  singularly  reparametrized  (SR) 
bilinear  patches.  An  SR  bilinear  patch  is  defined  as  follows: 


L(U,V)  =(1  -  5(7’))(1  -  s(t})Vij  +  (1  -  s(r))5(t)l^ij  +  i 

-H  s(r)(l  -  s(t))Vi^ij  +  s(7-)s(t)Fi+ij+i,  (6) 

whereu  €  e  ,  t  =  ,  and  the  functions 

s{*)  are  the  Hermite  polynomials  given  earlier.  That  is,  it  is  parametrized  such 
that  its  first  and  second  order  partial  derivatives  go  to  zero  at  the  boundaries: 


duL{ui,v)  =  duL{ui^i,v)  =  dyL{u,Vj)  =  dyL{u,Vjj^i)  =  0, 
dlL{ui,v)  =  dlL{uij^i,v)  =  dlL{u,Vj)  =  dlL{u,Vj+i)  =  0, 
dl^L(ui,v)  =  dl^L{ui^i,v)  =  dl^L{u,Vj)  =  dl^L{u,Vj^i)  =  0. 

The  a-spline  surface  is  then  given  by 

Q[u,v)  =  [I  -  a{u,v))S(u,v)  Ol{u,v)L{u,v),  U  G  [uo,'Um]; 

where  a{u,v)  is  the  blending  function  that  interpolates  the  local  tension  pa¬ 
rameters  a*  j,  defined  by  an  equation  similar  to  (6). 

To  find  an  a-spline  surface  interpolating  a  given  network  of  data  points 
Pijii  =  0,  ...,m,  j  =  0,  ...,n,  we  must  first  find  a  NURB  surface,  then  deter¬ 
mine  a  network  of  SR  bilinear  patches  to  be  blended  with  the  NURB  surface. 
The  NURB  surface  is  defined  by  simply  letting  the  data  points  be  the  control 
points.  As  in  the  case  of  curves,  we  repeat  the  boundary  vertices  and  let 
the  first  and  last  four  knot  values  be  equal.  Repeating  the  boundary  vertices 
along  the  j-index,  then  repeating  those  along  the  z-index,  we  obtain 

P —i,j  ~  -F o,j  and  P m+ij'  “  P m,j}  j  ~  O?  ...,ti, 

P t,— 1  “  -F i,o  and  P i,n+i  ~  Pi, ^  1. 
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Setting  the  end  knot  values  to  be  equal  in  the  knot  vectors  U={u-3, Um+s} 
and  V  = 

'1£_3  =  U—2  ~  1  —  WQj  Um  —  '^m+1  ~  ‘^m+2  ~  '^m+3> 

^^-3  =  V_2  =  -y-l  =  Vo,  Vn=  V„+i  =  V„+2  =  V„+3. 

The  interior  knot  values  can  be  determined  using  any  good  parametrization 
method.  The  NURB  surface  is  then  defined  by 

m+l  n+1 

S{u,v)  u  G  [vo,v^l;  V  G  [vo,Vn], 

i=-li=-l 

where  i2i,4(v)  and  Rj^4(v)  are  cubic  rational  B-spline  basis  functions  defined 
over  U  and  V. 

To  determine  the  vertices  Vij’s  that  define  the  SR  bilinear  patches,  we 
impose  constraints  on  the  Q:-spline  surface  to  interpolate  the  data  points: 

=  (1  -  alj)S(ui,Vj}  +  i  =  0,  j  ~  0, 

Solving  for  Vij,  we  obtain 

Vij  =  Pij  H  —  —  S{ui,Vj)). 

Defining  pij  =  yields 

Vij  =  Pij pi,j{Pi,j  •S' (tij,  Vj)),  i  0,  ...,Tn,  j  0,...,n. 

This  enables  the  direct  evaluation  of  Vij,  and  avoids  the  necessity  of  solving 
linear  systems  required  by  interpolation  with  NURB  surface. 

§5.  Smoothing  Polygonal  Shapes 

In  addition  to  fitting  a  smooth  surface  over  a  dataset,  the  proposed  interpo¬ 
lation  method  can  also  be  viewed  as  an  interactive  modeling  tool  for  deform¬ 
ing  polygonal  shapes  (with  an  underlying  rectangular  topology)  into  smooth 
objects.  The  vertices  of  the  polygonal  shape  are  the  data  points  to  be  in¬ 
terpolated  by  a  smooth  surface.  With  this  modeling  tool,  the  user  only  has 
to  specify  the  polygonal  vertices,  which  are  fewer  than  the  number  of  control 
points  of  most  spline  schemes.  Manipulating  polygonal  shapes  is  also  simple 
and  easy  for  novice  designers. 

Figures  4  and  5  show  some  modeling  examples,  all  of  which  are  obtained 
from  the  same  input  polygonal  shape,  shown  in  the  top  left  corner  of  Fig.  4. 
In  Fig.  4,  the  relaxation  parameters  are  varied:  globally  in  the  top  row,  and 
locally  in  the  bottom  row  (the  third  row  of  vertices  have  their  relaxation 
parameters  varied  while  the  other  vertices  have  fixed  pi^j  =  1).  In  Fig.  5,  the 
weights  of  the  third  row  of  vertices  are  varied,  while  the  other  weights  are 
fixed  at  unity,  and  the  global  relaxation  is  set  at  pij  =  1  for  ail  i,j. 


Fig.  4.  Effect  of  global  (top)  and  local  (bottom)  relaxation:  0,  0.25,  0.67,  1.5,  2.3. 


Fig.  5.  Effect  of  local  weight  (third-row  vertices):  2.5,  1,  0.5,  0.25,  0. 

§6.  Homogeneous  Representation  of  a-spline 

It  is  well  known  that  the  rational  B-spline  can  be  viewed  as  the  projection 
of  a  polynomial  B-spline  in  homogeneous  space.  This  property  is  important 
because  it  implies  that  all  the  algorithms  for  the  polynomial  B-spline  can  also 
be  applied  to  the  rational  B-spline.  The  rational  B-spline  is  given  by 


C(u)  =  Y,RiA^)Pi, 


(7) 


Nj^i{u)wj 


where  Pj  =  {xj,yj,Zj)  are  the  control  points  and  Rj^4{u)  =  ^ 

'^j  ^  0,  are  the  weights,  and  Nj^4{u)  denotes  the  cubic  B-spline  basis  func¬ 
tions.  The  polynomial  B-spline  curve  in  the  homogeneous  space,  C^{u),  whose 
projection  yields  the  rational  B-spline  C{u)  is  given  by 


c\u)  =  J2nsa{u)p';, 


(8) 


where  Pj  {^WjXj ,Wjyj,'WjZj^Wj),  since  C{u)  —  ( j  («) I'h® 

X,  y,  z  and  w  subscripts  denote  the  respective  components  of  C^{u). 
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We  can  show  that  this  property  is  also  true  for  the  a-spline.  Substituting 
C(u)  from  (7)  into  the  alpha-spline  equation  in  (3),  and  rewriting  it  as  a 
rational  function,  yields 

_  (1  -  tt(ii))  X)  Nj,i{'u)wjPi  +  a{u)C!i,{u)L{u) 

’  ~  Caiu) 

The  denominator  can  be  expressed  as  (1  —  Q!(u))C^(u)  a{u)C^{u)]  hence, 

the  polynomial  form  of  the  a-spline  in  homogeneous  space  is 

Q^(u)  =  (1  -  a{u))C^{u)  +  a{u)L^{u),  u  e  [uq,  Un], 
where  C^{u)  is  given  in  (8),  and  L^{u)  is  given  by 

L''{u)  =  C^{u){(l  -  4t))V^  +  n  e  h,  «i+i), 

where  Vj  =  1)  is  obtained  from  Vj  =  {x'j,y'j,z'j). 

§7.  Conclusion 

We  have  proposed  a  new  interpolation  scheme  based  on  blending  a  non- 
interpolatory  NURB  curve  (surface)  with  an  SR  linear  spline  (SR  bilinear 
patches).  The  resulting  interpolating  o-spline  inherits  the  continuity,  and 
the  affine  and  projective  invariant  properties  of  the  NURB.  The  method  pro¬ 
vides  weight  and  relaxation  control,  involves  only  simple  computations,  and 
supports  the  modeling  paradigm  of  deforming  polygonal  shapes  into  smooth 
spline  surfaces. 

Several  issues  have  yet  to  be  investigated  for  the  a-spline.  One  example 
is  knot  insertion  which  is  useful  for  shape  refinement  and  rendering.  Another 
issue  is  the  automatic  determination  of  the  parameters  to  satisfy  certain  shape¬ 
preserving  conditions. 
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On  Properties  of  Contours  of 
Trilinear  Scalar  Fields 


Holger  Theisel 


Abstract.  We  study  properties  of  contour  surfaces  of  trilinear  scalar 
fields,  and  give  a  classification  based  on  how  many  unconnected  surface 
parts  they  consist  of.  Furthermore,  we  introduce  the  concept  of  the  seg¬ 
ment  number  of  a  voxel.  The  segment  number  is  a  threshold-independent 
measure  which  estimates  how  complicated  the  contours  inside  the  voxel 
are  expected  to  be.  Finally,  we  give  necessary  and  sufficient  conditions  for 
a  voxel  to  have  a  segment  number  of  1.  These  conditions  are  applied  to 
analyze  a  computer  tomography  data  set. 


§1.  Introduction 

Contours  (isosurfaces)  of  trilinear  scalar  fields  are  treated  in  a  variety  of  appli¬ 
cations.  For  instance,  the  data  used  in  volume  visualization  usually  consists 
of  a  number  of  scalars  defined  at  certain  grid  points;  between  the  grid  points 
a  piecewise  trilinear  interpolation  of  the  scalar  field  is  applied. 

Given  a  voxel  V  =  [0, 1]^,  the  trilinear  scalar  field  is  defined  by  setting 
the  values  Cijk{i,j,  k  G  {0, 1})  of  the  field  at  the  corners  of  V .  Then  the  scalar 
field  is  defined  as 


s(zi,  u,  tu)  =  (1  -  -u)  •  (1  -  u) '  (1  -  w)  ■  cooo  +  (1  -  It)  •  (1  -  'y) '  u;  •  cqoi 
-f  (1  -  ii)  •  u  •  (1  -  lu)  •  coio  -\-{l-u)>vw‘  Con 
-h  u  “  (1  -  u)  ■  (1  -  ic)  •  cioo  -t-  u  •  (1  -  u)  •  ic  •  cioi 
-P  w  •  u  •  (1  —  ic)  •  Clio  -{-u-v-w-  Cm- 


Figure  la  illustrates  this.  A  contour  of  V  is  defined  by  s{u,v,w)  =  r  =const 
for  a  certain  threshold  r.  Figure  lb  shows  an  example  of  a  contour  of  (1). 

There  are  a  number  of  algorithms  to  produce  a  triangular  approximation 
of  a  contour  of  (1).  Of  these,  the  Marching  Cubes  (MC)  method  ([3]  and  [4]) 
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Fig.  1.  a)  Voxel  F;  b)  a  contour  in  F;  c)  result  of  MC. 

is  the  most  popular.  Figure  Ic  shows  the  resulting  triangular  approximation 
of  the  contour  shown  in  Figure  lb  using  the  Marching  Cubes  method. 

The  Marching  Cubes  algorithm  distinguishes  several  cases  where  some  of 
them  are  harder  to  treat  than  others.  In  this  paper  we  introduce  a  measure 
of  how  costly  in  terms  of  computing  time  the  MC  algorithm  inside  a  certain 
voxel  is  expected  to  be.  This  characterization  of  a  voxel  -  called  segment 
number  -  is  independent  of  a  particular  threshold.  It  estimates  the  costs  of  a 
Marching  Cubes  algorithm  for  varying  thresholds. 

As  already  stated  in  [2],  the  contour  of  (1)  is  a  rational  cubic  surface.  In 
[2]  this  surface  is  approximated  by  a  collection  of  rational  quadratic  triangular 
patches. 

Section  2  of  this  paper  studies  the  contours  of  (1)  in  the  domain  IR^. 
We  give  a  classification  based  on  how  many  unconnected  surface  parts  the 
contours  consist  of.  Sections  3  and  4  focus  on  contours  of  (1)  inside  a  certain 
voxel.  Section  3  introduces  the  concept  of  segment  number  as  a  measure  of 
how  simply  a  voxel  can  be  treated  by  an  MC  algorithm.  In  Section  4,  necessary 
and  sufiicient  geometric  conditions  for  a  voxel  to  have  a  segment  number  of 
1  are  shown.  In  Section  5,  the  number  of  voxels  with  a  segment  number  of  1 
are  computed  for  a  real  volume  data  set. 

§2.  Classification  of  the  Contour  in 

In  this  section  we  consider  the  contour  of  (1)  not  in  a  particular  voxel  but  in 
the  domain  R^.  In  general,  the  contour  consists  of  a  number  of  surface  parts 
which  are  not  connected  to  each  other.  Before  we  classify  the  contours  of  (1) 
by  the  number  of  unconnected  surface  parts,  we  apply  a  translation  of  the 
coordinate  system  as  shown  in  Figure  2.  Choosing 


^^0  =  -  • 

P 


we  obtain  for  (1) 


P  =  cooi  +  coio  +  cioo  +  Cm  -  cqoo  “  cqh  -  cioi  -  cuo 

/  Cqoo  +  Con  —  cooi  ~  coio  \ 

1  . 

Cqoo  +  cioi  —  cooi  “  cioo 


\cooo  +  Clio  -  coio  -  Cioo  / 


s  =  a‘U  +  b-  v  +  c-  w  +  d'U’V-w-\-e 


(2) 
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Fig.  2.  Translating  the  coordinate  system  of  a  voxel. 

with 

_ (ciii  —  Coil)  •  (cioo  -  cooo)  —  (ciio  —  coio)  •  (cioi  —  cooi) 

CL 

p 

^  _  (cm  —  Cioi)  •  (cqio  -  Cqoo)  -  (cup  -  Ciqq)  •  (cpii  -  Cqoi) 

p 

^  _  (cill  -  Clio)  •  (cpoi  -  Cpoo)  -  (cioi  -  Cloo)  •  (cpii  -  Cpio) 

p 

d  =  p, 

where  e  is  a  certain  constant.  Thus,  we  only  have  to  analyze 

s{u,  v,w)  =  a  •  u-\-  b  ’  V  -j-  c  •  w  d  ’  u  '  V  '  w  =  r  =  const  (3) 

in  A  classification  of  (3)  can  be  achieved  by  rewriting  (3)  asw  = 
and  comparing  the  zeros  of  the  numerator  and  denominator  function.  The 
zeros  of  the  numerator  function  form  a  line  in  the  u  —  v— plane,  whereas  the 
zeros  of  the  denominator  function  give  a  hyperbola.  Studying  their  interplay 
gives  the  following  classification: 

case  1:  abed  <  0,  d  /  0  : 

case  1.1:  (3)  gives  3  unconnected  surface  parts 

case  1.2:  (3)  gives  2  unconnected  surface  parts 

case  2:  abed  <  0:  (3)  consists  of  1  connected  part 
case  3:  abed  =  0,  d  0: 
case  3.1:  r  ^  0  : 

case  3.1.1:  /  0,c  =  0:  (3)  gives  2  unconnected  surface  parts 

case  3.1.2:  a  ^  0,b  =  c  =  0:  (3)  gives  3  unconnected  surface  parts 
case  3.1.3:  a  —  b  =  c  =  0:  (3)  gives  4  unconnected  surface  parts 
case  3.2:  r  =  0  : 

case  3.2.1:  a6  7^  0,  c  =  0:  (3)  gives  3  unconnected  surface  parts 
case  3.2.2:  a  ^  0, 5  =  c  =  0:  (3)  gives  3  parts  intersecting  each  other 
case  3.2.3:  a  =  b  =  c  =  0:  (3)  gives  3  perpendicular  planes. 


Figure  3  illustrates  these  cases. 
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Fig.  3.  Classification  of  the  contours  of  (3)  in  IR^  . 

§3.  Segment  Number  of  a  Voxel 

We  now  study  the  contour  of  (3)  in  a  particular  voxel  V  =  [ito,'Wo  +  1] 
[vo)t^o  +  1]  X  [wq^wo  +  !]•  Unfortunately,  the  results  of  Section  2  are  not 
directly  applicable  here  because  one  connected  surface  part  may  intersect  V 
more  than  once. 

Varying  the  threshold  r  in  (3),  the  contours  change.  So  does  the  number 
of  unconnected  surface  parts  of  the  contour. 

Definition  1.  Given  the  trilinear  scalar  held  s(u^v,w)  =  a-u-j-b-v-j-c-w-i- 
d’U-v-w  in  the  domain  of  the  voxel  V  =  [tio,tio  +  l]  x  [t;oj^o  +  l]  x  [wq,  wq  +  I], 
the  segment  number  iS'(V)  ofV  is  the  maximal  number  of  unconnected  surface 
parts  of  the  contour  s(u,v,w)  =  r  =const  in  V  for  any  threshold  r. 

Figure  4  gives  an  example  of  a  voxel  V  with  *S'(V)  =  1.  Increasing  the 
value  of  r,  the  isosurface  ’’moves”  through  the  voxel.  It  consists  of  at  most 
one  connected  part  for  any  r.  Figure  5  shows  a  voxel  with  5(1^)  =  4.  Here 
the  contours  consist  of  up  to  4  unconnected  parts. 

The  segment  number  is  a  threshold-independent  characterization  of  a 
voxel  V.  For  any  V  we  get  5'(V)  G  {1,2,3, 4}.  For  visualization  purposes, 
voxels  with  a  segment  number  1  are  of  particular  interest.  As  shown  in  the 
example  of  Figure  4,  they  have  a  nice  behavior  while  varying  r.  In  fact,  for 
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Fig.  5.  Contours  of  a  voxel  with  S{V)  =  4. 

any  r  the  contour  consists  of  only  one  connected  surface  part  inside  V.  Thus, 
accelerated  Marching  Cubes  methods  may  apply  to  them.  Moreover,  adjacent 
voxels  with  5'(V')  =  1  may  be  merged  to  form  one  bigger  voxel  before  applying 
Marching  Cubes  methods.  So  it  makes  sense  to  search  for  geometric  conditions 
for  a  voxel  V  to  have  iS'(V')  =  1. 


§4.  Geometric  Conditions  for  5(V)  =  1 

In  this  section  we  give  necessary  and  sufficient  geometric  conditions  for  a 
voxel  to  have  5(1^)  =  1.  Again,  we  consider  the  contour  of  (3)  in  the  voxel 
V  =  [uo,Uo  +  1]  X  [uo,Vo  +  1]  X  [wq^Wq  +  1]. 

To  formulate  the  conditions  for  5'(V’)  =  1,  we  need  to  introduce  the 
concept  of  characteristic  hyperbolas.  The  first  characteristic  hyperbola  hi  in 
is  defined  by  the  condition  Sv.w{u,v,w)  =  0  in  (3).  hi  can  be  written  as  a 
rational  quadratic  Bezier  curve  described  by  two  control  vectors  6j,  bl  and  a 
control  point  6j  (see  [1]).  For  hi  we  obtain 

/(~4bc)/d\  /0\  /  0  \ 

*5=1  0  j,  6j=loj,  &^=ll/6j, 


w\  =  1, 
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Fig.  6.  Location  of  characteristic  hyperbolas;  a),b):  abed  <  0;  c),d):  abed  >  0. 
where  toj  is  the  weight  of  6}.  Then  we  obtain 

- - WWW - 

In  a  similar  way  we  define  the  characteristic  hyperbola  h2  by  Suu;(ti,  v,  u>)  =  0, 
and  hs  by  Suv(u,v,w)  =  0.  The  Bezier  point  bi  with  the  corresponding  weight 
Wi  and  the  control  vectors  60,62  describing  ft 2  are 


0 

60  =  I  (-4ac)/d 

0 


fts  is  described  by 


W  =  1- 


b^o  = 


0 
0 

A-4ab)/d^ 


U  a  b-  c  •  d  <  0  then  fti,  ft2,  fts  intersect  in  two  common  points.  Figures  6a 
and  6b  illustrate  this  situation  from  two  different  viewpoints.  If  a  •  6  •  c  •  d  >  0, 
then  fti,ft2,ft3  do  not  have  any  intersections.  Figures  6c  and  6d  show  this 
from  different  viewpoints.  The  degenerate  case  a  •  6  •  c  •  d  =  0  is  omitted  here. 

To  formulate  conditions  for  5'(V')  =  1,  we  have  to  classify  the  faces  of  V. 
Given  the  voxel  y  =  [uo,uo  +  l]  x  [ucj'^o  +  l]  x  +  let  =  {(u,v,w)  G 

V  ;  tt  =  uo  V  -u  =  Wo  +  1},  /2  =  {{w,w,w;)  eV  :  w  =  uq  V  u  =  wq  +  1},  and 
/a  =  :w  =  WoVw  =  wo  +  l}.  See  Figure  7  for  an  illustration 

of  the  faces. 


Theorem  1.  Let  V  =  [wo,wo  +  1]  x  [wo,wo  +  1]  x  +  1]  be  a  voxel  in 

the  scalar  field  defined  by  (3).  Then  the  condition  .^(V)  =  1  is  equivalent  to 
the  three  conditions  ft^  fi  /^  =  0  and  ft2  H  /2  =  0  and  fts  H  /g  =  0. 

Figure  8  illustrates  the  idea  of  the  proof.  Suppose  intersects  as 
shown  in  Figure  8a.  Figure  8b  is  a  magnification  of  the  voxel  and  fts  in 
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Fig.  7.  The  faces  /i,  /2,  /s  of  a  voxel. 


Fig.  8.  Proof  idea  of  Theorem  1. 

Figure  8a.  We  compute  the  intersection  point  of  hs  and  /s,  and  consider 
the  contour  passing  through  this  point.  As  shown  in  Figure  8a,  this  contour 
consists  of  at  least  two  surface  parts. 

For  the  proof  of  the  converse  statement  of  Theorem  1,  we  assume  that  for 
a  certain  threshold  r  the  contour  consists  of  at  least  two  unconnected  surface 
parts.  Then  we  can  find  a  face  of  V  which  has  two  intersection  curves  with  the 
contour.  (In  the  worst  case  we  have  to  vary  r  to  find  such  a  face).  (Figure  8c 
shows  two  surface  parts  of  the  contour  which  produce  two  intersection  curves 
in  the  upper  face  of  /g).  Then  we  can  find  a  point  on  this  face  which  is  the 
intersection  point  with  the  corresponding  characteristic  hyperbola.  (In  Figure 
8c,  the  marked  point  on  the  upper  part  of  /g  is  the  intersection  with  hs). 


§5.  Results  and  Future  Work 

We  have  tested  the  voxels  of  a  CT  test  data  set  for  the  property  iS'(V’)  =  1. 
The  data  set  consists  of  255  x  255  x  108  —  7,022, 700  voxels.  Figure  9  shows 
a  slice  through  the  data  set. 

In  the  raw  data  we  found  1,978,711  voxels  with  5'(V’)  =  1  (28  %).  After 
some  noise  reducing  filter  operations  on  the  data,  we  detected  4,833,063  voxels 
with  (S'(y)  =  1  (69  %).  This  shows  that  there  is  a  reasonable  number  of  voxels 
with  5(1^)  =  1  to  pay  special  attention  to  them. 

In  the  future  we  plan  to  develop  algorithms  to  merge  voxels  with  5'(V')  =  1 
to  form  bigger  voxels  before  starting  the  Marching  Cubes  algorithm. 
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Fig.  9.  Slice  through  the  test  data  set. 
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Non-Stationary  Subdivision  for  Inhomogeneous 
Order  Differential  Equations 


Joe  Warren  and  Henrik  Weimer 


Abstract.  This  paper  provides  a  methodology  for  the  systematic  deriva¬ 
tion  of  subdivision  schemes  that  model  solutions  to  inhomogeneous  order 
linear  differential  equations.  In  previous  work,  we  showed  that  subdivi¬ 
sion  can  be  used  to  capture  very  efficiently  the  solutions  of  homogeneous 
order,  linear  differential  equations.  The  resulting  subdivision  masks  are 
stationary  and  can  be  precomputed,  allowing  for  very  simple  and  fast  ap¬ 
plication  of  these  schemes.  In  this  paper,  we  show  that  this  method  can 
be  extended  to  express  solutions  of  systems  of  inhomogeneous  order,  lin¬ 
ear  differential  equations.  Even  though  the  resulting  subdivision  masks 
may  be  non-stationary,  the  masks  can  again  be  precomputed.  Thus,  the 
resulting  subdivision  schemes  capture  very  efficiently  solutions  of  inhomo¬ 
geneous  order,  linear  partial  differential  equations. 


§1.  Subdivision  for  the  Modeling  of  Shapes 

Subdivision  is  a  popular  and  efficient  method  for  modeling  shapes.  In  par¬ 
ticular,  subdivision  describes  a  continuous  shape  p  as  the  limit  of  a  sequence 
Pfc,  k  >0  of  discrete  shapes, 


lim  pk  =  p. 

K—*00 

The  beauty  of  subdivision  lies  in  the  fact  that  these  discrete  shapes  pk 
are  linked  by  a  simple  linear  transformation  S  which  is  based  on  splitting  and 
averaging, 

Pk  =  Sk~ipk-i- 

Figure  1  shows  an  example  of  a  subdivision  scheme.  Starting  from  the 
coarse  shape  po  on  the  left,  application  of  the  subdivision  matrix  So  yields  the 
denser  shape  pi .  As  we  continue  the  process,  the  sequence  of  discrete  shapes 
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Po  Pi  P2 


Fig.  1.  Subdivision  models  a  shape  as  the  limit  of  a  sequence  of  discrete  shapes. 

converges  rapidly  to  a  continuous  shape  p  that  follows  the  original  coarsest 
shape  Po  siiid  whose  properties  are  determined  by  the  subdivision  matrix. 

Subdivision’s  popularity  for  the  modeling  of  curves  is  due  to  the  algo¬ 
rithms  by  Chaikin  [9],  and  Lane  and  Riesenfeld  [6].  The  breakthrough  for  the 
modeling  of  surfaces  via  subdivision  was  marked  by  the  papers  by  Catmull 
and  Clark  [2]  and  by  Doo  and  Sabin  [3].  A  popular  subdivision  scheme  for 
modeling  with  triangular  meshes  has  been  proposed  by  Loop  [7],  which  was 
also  used  for  creating  Figure  1. 

§2.  Shape  Modeling  through  Differential  Equations 

Alternatively,  shapes  can  be  characterized  as  solutions  to  partial  differential 
equations.  For  example,  any  polynomial  spline  p[x]  of  degree  m  satisfies  the 
diff’erential  equation  =  0,  requiring  the  (m  H-  l)st  derivative  of  the 

spline  to  be  zero  everywhere  except  at  a  fixed  number  of  knots  [1].  Other  ex¬ 
amples  of  shapes  based  on  partial  differential  equations  are  the  polyharmonic 
surfaces,  including  Thin  Plate  Splines,  as  well  as  many  different  classes  of  fluid 
flows. 

When  modeling  with  differential  equations,  we  determine  a  continuous 
shape  p  that  is  a  solution  to  a  set  of  partial  differential  equations 

Dp  =  b,  (1) 

where  D  denotes  a  continuous  differential  operator  and  6  encodes  the  bound¬ 
ary  conditions  for  the  problem.  For  the  example  of  natural  cubic  splines,  we 
have  D  —  and  6  =  0  almost  everywhere.  If  all  differential  operators  in 
D  are  of  the  same,  fixed  order,  we  call  the  differential  equation  homogeneous 
order.  Otherwise,  the  equation  is  called  inhomogeneous  order. 

To  handle  such  problems  in  a  computational  environment,  one  commonly 
discretizes  the  continuous  problem.  To  this  end,  a  domain  grid  is  chosen 
and  all  entities  of  the  continuous  partial  differential  equation  (1)  are  discretized 
over  this  domain  grid.  The  result  is  a  system  of  linear  equations 


DkPk  =  bk 


(2) 
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where  pk  denotes  an  approximation  of  the  continuous  solution  p  over  the  grid 
Tfc,  bk  denotes  a  discretization  of  the  boundary  conditions,  and  Dk  is  a  discrete 
approximation  of  the  continuous  differential  operators  D  on  the  domain  grid 

Tjfc. 

Relying  on  the  theory  for  finite  elements  or  finite  differences  [11],  the 
discrete  solutions  pk  can  be  formally  guaranteed  to  converge  to  the  continuous 
solution  p  of  the  original  continuous  problem  (1)  if  the  discretizations  Tk  are 
chosen  carefully  and  the  discrete  representations  Dk  and  bk  are  well  chosen. 

At  this  point,  the  problem  of  finding  the  continuous  solution  p  of  the 
system  of  continuous  partial  differential  equations  (1)  has  been  reduced  to  the 
problem  of  solving  denser  an  denser  systems  of  linear  equations  (2). 

The  links  between  mesh  modeling  and  differential  equations  were  previ¬ 
ously  investigated  by  Mallet  [8],  Taubin  [12],  and  Kobbelt  [5].  The  method 
presented  here  is  new  because  subdivision  schemes  that  model  solutions  of  in¬ 
homogeneous  order  differential  equations  are  precomputed  entirely,  enabling 
very  efficient  modeling  of  shapes  guided  by  inhomogeneous  order  differential 
equations.  In  particular,  the  actual  application  of  the  subdivision  schemes 
does  not  require  any  computational  solving  whatsoever. 

§3.  Subdivision  for  Homogeneous  Order  Differential  Equations 

In  our  previous  work  [13,14]  we  characterized  subdivision  schemes  for  the 
solutions  of  homogeneous  order  linear  partial  differential  equations.  In  this 
framework,  the  subdivision  matrix  ^fc-i  is  determined  as  the  solution  to  the 
system  of  linear  equations 


OkSk-i  =  2^Uk-iDk-i,  (3) 

where  d  is  the  dimension  of  the  domain.  Recall  that  the  differencing  operator 
Dk  is  the  discrete  approximation  of  the  continuous  differential  operator  D  of 
the  original,  continuous  problem  (1)  on  the  level  k  grid  Tfc.  Further,  Uk-i 
denotes  a  very  simple  linear  transformation,  called  replication  or  upsampling, 
that  carries  coefficients  over  the  grid  Tk-i  into  coefficients  over  the  next  denser 
grid  Tfc.  The  action  of  Uk~i  is  very  simple:  Coefficients  centered  over  knots  in 
Tk-i  are  replicated  over  the  same  knots  in  the  denser  grid  Tk  while  coefficients 
centered  over  the  remaining  knots  Tk  —  Tk-i  are  set  to  zero.  Thus,  Uk-i  is  a 
matrix  whose  rows  are  either  zero  or  a  standard  unit  vector,  and  Uk-i  can  be 
constructed  easily  and  efficiently. 

We  visualize  the  meaning  of  equation  (3)  in  Figure  2:  The  subdivision 
matrix  is  determined  so  that  a  certain  commutativity  relationship  holds  be¬ 
tween  subdivision,  upsampling  and  differencing.  Differencing  coefficients  on 
the  coarse  grid  and  upsampling  those  differences  to  the  finer  grid  by  inserting 
zero  for  all  new  grid  points  {Uk-iDk-i,  the  right  hand  side  of  equation  (3)) 
should  yield  the  same  result  as  subdividing  the  coefficients  using  the  subdi¬ 
vision  scheme  and  then  differencing  on  the  finer  grid  {DkSk-iy  the  left  hand 
side  of  equation  (3)). 
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Fig.  2.  The  subdivision  scheme  is  determined  such  that  this  commutativity  re¬ 
lationship  holds. 

The  subdivision  matrices  Sk  are  the  only  unknowns  of  equation  (3),  and 
we  can  use  linear  algebra  to  systematically  solve  for  these  subdivision  matri¬ 
ces.  In  our  previous  work  [13]  we  showed  that  solutions  produced  by  these 
subdivision  matrices  are  related  to  solutions  produced  by  an  interpolating  fi¬ 
nite  element  solver  using  a  simple,  fixed  change  of  basis.  Therefore,  if  the  finite 
element  solver  converges,  then  the  subdivision  solution  is  also  well  defined. 

As  a  side  note,  in  our  previous  work  [13]  we  establish  that  the  right-hand 
side  of  the  system,  as  solved  by  the  subdivision  scheme  from  relation  (3),  is 
DkPk  =  Uk-iUk-2-^-UoDoPo  where  pk  =  Sk-iPk-i  and  po  is  a  user-given 
set  of  initial  control  coefficients.  In  other  words,  the  subdivision  scheme  leads 
to  a  specific  combination  of  the  integer  shifts  of  the  Green  function  of  the 
differential  operator. 

Further,  computation  of  the  subdivision  matrix  Sk-i  based  on  relation 
(3)  requires  the  inversion  of  the  differencing  operator  D*.  Consequently,  the 
computational  work  required  for  finding  the  subdivision  matrix  is  at  least 
the  same  as  inverting  the  finite  difference  system.  However,  later  we  will  see 
that  the  subdivision  matrices  can  be  precomputed.  Thus,  in  contrast  to  a 
conventional  finite  difference  solver,  new  shapes  can  be  generated  extremely 
efficiently. 

As  an  example,  we  briefly  derive  subdivision  schemes  for  piecewise  poly¬ 
nomial  splines.  Recall  from  deBoor  [1]  that  the  piecewise  polynomial  spline 
p[x]  of  degree  m  satisfies  the  differential  equation  Z>[x]”^^^p[x]  =  0  where 
D[x]  denotes  the  first  derivative  in  the  variable  x. 

We  employ  generating  functions  [4]  for  concise  and  convenient  encoding 
of  discrete  coefficient  sequences.  To  this  end,  we  choose  the  domain  grids  for 
our  analysis  as  the  dilates  ^ZZ  of  the  integer  grid  7L,  A  generating  function 
Pk[x]  is  a  power  series  that  associates  the  zth  coefficient  of  the  discrete  shape 
Pk  as  the  coefficient  of  x\  For  example,  the  coefficient  sequence  {1,2, 3, 4,  5} 
is  represented  by  1  +  2x  -t-  3x^  -f  +  hx^. 

Recall  the  definition  of  the  first  derivative  operator. 
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Substituting  t  =  ^  yields 


D[x]p[x]  =  lim 

k—^oo 


p[x]  -  p  [a;  +  ^] 
1 


Thus,  for  X  €  the  approximation  of  the  first  derivative  is  given  by  the 
difference  between  two  adjacent  discretizations,  normalized  by  the  grid  spac¬ 
ing.  In  terms  of  generating  functions,  this  differencing  operation  is  represented 
by  the  Laurent  polynomial 


Higher  order  derivatives  and  differences  are  obtained  by  repeated  application 
of  the  respective  continuous  or  discrete  operator. 

In  terms  of  generating  functions,  the  action  of  the  upsampling  operator 
Uk  can  be  captured  very  concisely:  The  expression  p  [x^]  represents  the  up- 
sampled  coefficient  sequence  of  p[x]  as  a  generating  function.  Thus,  in  our 
example  of  polynomial  splines,  the  generating  function  Sk[x]  for  the  subdivi¬ 
sion  scheme  satisfies 


D,[xr+^s,[x]  = 

which  can  be  simplified  to 


—  2 


Dk-i[x^] 

Dle[x] 


jn+l 


Fortunately, 

Dk-i[x‘^]  1  H-x 

Dk[x]  ~  2  xi/2  ’ 


i.e.  the  generating  functions  for  the  differencing  operations  on  the  level  A:  —  1 
and  level  k  grids  divide  out  yielding  a  simple  expression  independent  of  A:.  As 
a  result,  the  subdivision  mask  for  the  degree  m  polynomial  splines  are  exactly 
the  coefficients  of 


s[x]  =  2 


l-\-x\ 

2x1/2  J 


m+l 


Remarkably,  these  are  precisely  the  known  subdivision  schemes  for  piece- 
wise  linear  functions  (m  =  1),  Chaikin’s  algorithm  [9]  (m  =  2)  and  the 
Lane/Riesenfeld  algorithm  [6]  (m  =  3). 

Previously  we  applied  this  strategy  to  derive  subdivision  schemes  model¬ 
ing  solutions  of  homogeneous  order  linear  differential  equations  yielding  local, 
stationary  subdivision  masks  [13,14].  In  this  paper,  we  show  that  largely  the 
same  strategy  can  be  used  to  determine  subdivision  schemes  for  inhomoge¬ 
neous  order  linear  partial  differential  equations.  As  we  will  see,  in  this  case 
the  actual  subdivision  masks  may  depend  on  the  particular  level  of  subdivi¬ 
sion,  i.e.  are  non-stationary.  However,  the  masks  can  still  be  precomputed  as 
a  closed  form  algebraic  expression  in  the  level  of  subdivision,  which  can  then 
be  evaluated  very  eflficiently  during  the  actual  application  of  the  scheme. 
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§4.  Subdivision  for  Inhomogeneous  Order  Differential  Equations 

In  this  section  we  extend  our  systematic  construction  of  subdivision  schemes  to 
handle  inhomogeneous  order  linear  partial  differential  equations.  We  consider 
the  simple  yet  interesting  problem  of  splines  in  tension  [10].  The  continuous 
spline  in  tension  p\x\  for  tension  parameter  ot  satisfies  the  differential  equation 

(z>[a:]^  -  Dlxf^  p[x]  =  0,  (4) 

where  D[a:]  again  represents  the  continuous  first  derivative  operator  with  re¬ 
spect  to  the  variable  x.  Note  that  equation  (4)  incorporates  both  second  and 
fourth  derivatives  of  p[ar],  i.e.  is  inhomogeneous  order. 

Following  the  same  strategy  as  in  the  derivations  for  polynomial  splines, 
we  use  generating  functions  to  encode  the  discrete  approximation  pk  of  the 
spline  in  tension  on  grid  Tk  as  well  as  for  the  representation  of  the  differencing 
operation  Dk[x]  =  Next,  we  apply  equation  (3)  to  characterize  the 

subdivision  scheme  as  the  solution  to 

-  a^Dklxf'^  5fc_i[rc]  =  2  (^Dk-i  [x^Y  ~  [^‘^]  )  j  (5) 


which  can  be  simplified  to 

2  (^Dk-ilx^t  -  a'^Dk^i[x^f^ 

Sfc-i[a^]  =  „  ,  i4  2  n  r 

Dk[x]  -a^Dk[x] 


(6) 


However,  at  this  point  we  note  that  there  is  no  simple  closed- form  ex¬ 
pression  for  the  quotient  (unless  a  =  0).  In  other  words, 

there  is  no  finitely-supported  subdivision  scheme  sa;_i[x]  for  splines  in  ten¬ 
sion.  Moreover,  the  coefficients  of  the  Laurent  series  expansion  of  the  quotient 
Sfc_i[a^]  depend  on  the  level  of  subdivision  k,  i.e.  the  subdivision  scheme  has 
to  be  non-st  at  ionary. 

Fortunately,  due  to  the  structure  of  equation  (3)  the  coefficients  of  this 
expansion  decrease  very  rapidly  away  from  the  origin.  Thus,  we  can  approxi¬ 
mate  the  infinite  Laurent  expansion  of  the  subdivision  mask  well  by  a  locally 
supported  scheme.  To  this  end,  we  construct  the  generating  function  Sfc-i[a^] 
of  desired  support  symbolically  with  the  actual  coefficients  as  unknowns. 


n 

Sk-i[x]  =  YL 

i=-n 


for  a  user-defined  support  n.  We  then  construct  a  generating  function  for  the 
residual  of  equation  (5), 


rk[x]  =  (^Dklx]'^  -  a^Dklx]^^  5fc_i[a:]  -  2  (^Dk^i  [x"^]  -  Dk-i  [x"^]  ) 

=Y.’'W- 
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Using  linear  algebra,  we  can  now  solve  for  the  unknowns  of  (7)  sym¬ 
bolically  by  minimizing  the  least  squares  residual  of  the  coefficients  r J, .  The 
motivation  behind  our  strategy  is  to  construct  a  best  solution  for  the  charac¬ 
teristic  equation  (3)  of  given  support.  The  results  of  this  process  are  actual, 
symbolic  coefficients  for  the  local  subdivision  scheme  Sfc_i[a:],  depending  on 
the  tension  parameter  a  as  well  as  on  the  level  k.  As  an  example,  the  ap¬ 
proximation  to  (6)  with  the  same  support  as  the  Lane-Riesenfeld  algorithm 
(n  =  2)  has 

2^  *=  (693  2^+^°  *=+891 4*+^  *=  0^+3525  64*=  0^+399  4*+^  *=  a®+333  4*=  a®+26  a^°) 

8  (693  21+12  fc-f891 41+5  k  a2+3861 256'=  ^4+273  23+®*  aC+gTS  16*=  a8+274i+fc  aio+Ta^^) 

as  the  coefficient  for 

(4*+*=+a2)  (693  1024*= +171 4*=  a®+2  (891  256*= +219  4*+^  *=  a^+399  16*=  o'* +7  a®  )) 

4  (693  21+12  *+891 41+5  k  0,2+3861  256*  0^+273  2^+6  *=  ^^+675  16'=  q10+7 


as  the  coefficient  for  and  finally 

2079  4*+®  *=+6039  4*+®  *=  a^  +  1755  16*+^  *=0^+8265  2*+®*=  a®+5235  16*=  q®+2134*+*=  a*°+56a*^ 
8  (693  21+12  fc+891  41+s  fe  ^2+3861  256'“  ^4+273  2^+6  fc  a6+675  16''  a^+274^+>‘  aio+7 

as  the  coefficient  associated  with  x^.  Note  that  for  a  =  0  these  coefficients 
exactly  reduce  to  the  subdivision  scheme  for  natural  cubic  splines  based  on 
the  Lane-Riesenfeld  algorithm. 

During  an  actual  application  of  the  subdivision  scheme,  the  user-defined 
tension  parameter  a  and  the  current  level  of  subdivision  k  are  substituted 
into  the  symbolic  solution  Sfc_i[a:],  yielding  a  simple  generating  function  in 
only  the  variable  x.  The  coefficients  of  this  generating  function  encode  the 
subdivision  masks  for  the  spline  in  tension  for  the  given  tension  parameter  a 
at  the  current  level  k.  Again,  application  of  this  subdivision  scheme  is  very 
efficient.  For  example,  given  a  =  0,  the  above  expression  simplifies  to  the 
generating  function  for  natural  cubic  spline  subdivision,  independent  of  k. 


k  =  1 
k  =  2 
k  =  S 
fc  =  4 
k  =  h 
k  =  6 
k  =  7 
k  =  8 
k  =  9 
k=:10 
k  =  ll 


0.11063 

0.12345 

0.12489 

0.12499 

0.125 

0.125 

0.125 

0.125 

0.125 

0.125 

0.125 


0.55261 

0.52441 

0.50733 

0.50192 

0.50049 

0.50012 

0.50003 

0.50001 

0.5 

0.5 

0.5 


0.88274 

0.80178 

0.76487 

0.75386 

0.75097 

0.75024 

0.75006 

0.75002 

0.75 

0.75 

0.75 


0.55261 

0.52441 

0.50733 

0.50192 

0.50049 

0.50012 

0.50003 

0.50001 

0.5 

0.5 

0.5 


0.11063 

0.12345 

0.12489 

0.12499 

0.125 

0.125 

0.125 

0.125 

0.125 

0.125 

0.125 


Fig,  3.  Subdivision  masks  for  a  =  1,  fc  =  0. 
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Fig.  4.  Splines  in  tension  for  varying  a. 

Figure  3  shows  the  actual  coefficients  of  a  locally  supported  generating 
function  (n  =  2)  for  o;  =  1  and  k  =  1, . . . ,  11.  Coefficients  were  rounded  to 
five  significant  digits.  Note  that  the  coefficient  sequence  rapidly  converges  to 
the  subdivision  scheme  for  natural  cubic  splines.  Indeed,  after  a  few  rounds 
of  subdivision,  a  spline  in  tension  behaves  like  a  natural  cubic  spline  over  a 
denser  initial  grid  with  its  initial  control  coefficients  determined  by  the  first 
few  rounds  of  subdivision. 

Figure  4  depicts  application  of  four  rounds  of  the  local  subdivision  scheme 
(support  n  =  4)  for  a  ranging  from  0  to  5.  The  initial  control  polygon  is  shown 
as  a  thin  line,  while  the  subdivided  curve  is  shown  in  solid.  Note  that  as  a 
is  increased,  the  curve  follows  the  control  polygon  more  closely.  In  the  limit, 
a  — >  oo,  the  curve  is  actually  the  piecewise  linear  interpolant  of  the  initial 
control  points. 

Figure  5  shows  the  least  squares  residuals  of  approximations  of 

different  sizes  for  a  =  1  and  k  =  0  (the  residual  is  largest  for  fc  =  0)  on  a 
logarithmic  scale.  Note  that  for  the  approximation  of  size  n  =  4  the  residual 
is  already  very  small. 

At  a  higher  level,  we  follow  these  steps  in  the  derivation  of  non-stationary 
subdivision  schemes  for  inhomogeneous  order  linear  partial  differential  equa¬ 
tions: 

Starting  from  the  continuous,  inhomogeneous  order,  linear  partial  dif¬ 
ferential  equations  we  discretize  the  continuous  differential  operators  to  yield 
appropriate  differencing  operators  over  the  respective  subdivision  grids  Tk  ■  We 
then  characterize  the  subdivision  scheme  Sk-i  as  the  only  unknown  of  equation 
(3)  using  these  differencing  operators  as  well  as  simple  replication/upsampling. 
Next,  we  construct  a  representation  of  the  subdivision  scheme  Sk-i  in  terms 


Non- Stationary  Subdivision  for  Inhomogeneous  Order  PDF’s 


419 


Fig.  5.  Residuals  of  local  approximations  with  varying  support. 

of  unknowns  and  symbolically  build  a  residual  expression  representing  the  dif¬ 
ference  between  left-hand  side  and  right-hand  side  of  equation  (3).  Finally, 
we  use  linear  algebra  to  solve  symbolically  for  the  unknowns  of  the  subdivi¬ 
sion  scheme  5fc_i,  which  may  depend  on  the  level  of  subdivision  and  possible 
parameters  to  the  original  partial  differential  equations.  As  a  result,  applica¬ 
tion  of  the  subdivision  scheme  only  involves  instantiation  of  these  constants, 
yielding  a  locally  supported,  approximating  subdivision  scheme  for  solutions 
of  the  original  inhomogeneous  order  partial  differential  equations. 

§5.  Summary  and  Conclusion 

In  this  paper,  we  showed  that  subdivision  can  be  used  to  model  solutions 
of  inhomogeneous  order  differential  equations.  Using  the  characterization  of 
the  subdivision  scheme  based  on  the  commutativity  relationship  (3),  we  can 
systematically  solve  for  these  schemes.  Even  though  the  exact  subdivision 
schemes  may  be  globally  supported,  locally  supported  schemes  approximate 
the  solution  well  enough  for  practical  purposes.  Non-stationary  schemes  can 
be  handled  using  the  same  methodology  by  allowing  the  locally  supported  sub¬ 
division  masks  to  change  between  levels.  Because  these  subdivision  schemes 
can  be  precomputed,  the  modeling  of  solutions  of  inhomogeneous  order  linear 
partial  differential  equations  can  be  handled  very  efficiently. 

The  proposed  method  for  modeling  solutions  to  inhomogeneous  order  lin¬ 
ear  differential  equations  is  quite  general  and  promises  to  be  useful  in  a  variety 
of  applications.  First  of  all,  approximations  based  on  local  subdivision  schemes 
are  often  sufficient  for  modeling  applications.  Indeed,  the  approximate  solu¬ 
tions  are  qualitatively  indistinguishable  from  the  exact  solution.  Second,  if  the 
accuracy  of  the  subdivision  solution  is  not  satisfactory,  the  subdivision  scheme 
can  be  used  to  produce  very  good  initial  estimates  for  more  traditional  solu¬ 
tion  methods.  Third,  the  results  of  traditional  solution  methods  often  need  to 
be  refined  locally  for  visualization  and  analysis.  A  local  subdivision  scheme 
can  be  used  to  refine  solutions  to  any  desired  accuracy  and  provide  better 
accuracy  than  traditional  polynomial  fits. 

Acknowledgments.  This  work  was  supported  in  part  under  NSF  grant  num¬ 
ber  CCR-9732344. 


420 


J.  Warren  and  H.  Weimer 


References 

1.  de  Boor,  C.,  A  Practical  Guide  to  Splines.  Springer,  1978. 

2.  Catmull,  E.  and  J.  Clark,  Recursively  generated  B-spline  surfaces  on  ar¬ 
bitrary  topological  meshes.  Computer  Aided  Design  10  (1978),  350-355. 

3.  Doo,  D.  and  M.  Sabin:  Behavior  of  recursive  subdivision  surfaces  near 
extraordinary  points.  Computer  Aided  Design  10  (1978),  356-360. 

4.  Dyn,  N.,  Subdivision  schemes  in  computer  aided  geometric  design,  in 
Advances  in  Numerical  Analysis  II,  W.  Light  (ed.),  Oxford  University 
Press,  1992,  36-104. 

5.  Kobbelt,  L.,  Fairing  by  finite  difference  methods,  in  Mathematical  Meth¬ 
ods  for  Curves  and  Surfaces  II,  M.  Daehlen,  T.  Lyche,  and  L.  L.  Schu- 
maker  (eds.),  Vanderbilt  University  Press,  1998. 

6.  Lane,  J.  and  R.  Riesenfeld,  A  theoretical  development  for  the  computer 
generation  and  display  of  piecewise  polynomial  surfaces,  IEEE  Transac¬ 
tions  on  Pattern  Analysis  and  Machine  Intelligence  2  (1980),  35-46. 

7.  Loop,  C.  T.,  Smooth  Subdivision  Surfaces  Based  on  Triangles,  Master’s 
Thesis,  Department  of  Mathematics,  University  of  Utah,  1987. 

8.  Mallet,  J.-L.,  Discrete  smooth  interpolation.  Transactions  on  Graphics  4 
No.  2  (1985),  74-123. 

9.  Riesenfeld,  R.,  On  Chaikin’s  algorithm.  Computer  Graphics  and  Image 
Processing  4  (1975),  304-310. 

10.  Schweikert,  D.  G.,  An  interpolating  curve  using  a  spline  in  tension,  J. 
Math,  and  Physics  45  (1966),  312-317. 

11.  Shaidurov,  V.  V.,  Multigrid  Methods  for  Finite  Elements,  Kluwer  Aca¬ 
demic  Publishers,  1995. 

12.  Taubin,  G.,  A  signal  processing  approach  to  fair  surface  design.  Proceed¬ 
ings  of  SIGGRAPH  95,  Computer  Graphics  Proceedings,  Annual  Confer¬ 
ence  Series,  Addison  Wesley  (1995),  351-358. 

13.  Warren,  J.  and  H.  Weimer,  Subdivision  schemes  for  variational  problems, 
preprint  available  online  under  http://www.cs.rice.edu/~henrik/. 

14.  Weimer,  H.  and  J.  Warren,  Subdivision  schemes  for  fluid  flow.  Proceed¬ 
ings  of  SIGGRAPH  99,  Computer  Graphics  Proceedings,  Annual  Confer¬ 
ence  Series,  Addison  Wesley  (1995),  111-120. 

Joe  Warren  and  Henrik  Weimer 

Rice  University,  Department  of  Computer  Science 

P.O.  Box  1892 

Houston,  TX  77251-1892,  USA 
{ j warren, henrikjSr ice. edu 


